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Description 

[0001] The present invention relates to novel proteins and processes for obtaining them. These proteins are capable 
of inducing cartilage and bone formation. 

5 

Background 

[0002] . Bone is a highly specialized tissue characterized by an extensive matrix structure formed of fibrous bundles 
of the protein collagen, and proteoglycans, noncollagenous proteins, lipids and acidic proteins. The processes of bone 
io formation and renewal/repair of bone tissue, which occur continuously throughout life, are performed by specialized 
cells. Normal embryonic long bone development is preceded by formation of a cartilage model. Bone growth is pre- 
sumably mediated by "osteoblasts" (bone-forming cells), while remodeling of bone is apparently accomplished by the 
joint activities of bone-resorbing cells, called "osteoclasts" and osteoblasts. A variety of osteogenic, cartilage-inducing 
and bone inducing factors have been described. See, e.g. European patent applications 148,155 and 169,016 for 
is discussions thereof. 

Brief Description of the Invention 

[0003] The present invention provides novel proteins in purified form. Specifically, four of the novel proteins are 
20 designated BMP-1, BMP-2 Class I (or BMP-2), BMP-3, and BMP-2 Class II (or BMP-4) wherein BMP is bone morpho- 
genic protein. These proteins are characterized by peptide sequences the same as or substantially homologous to 
amino acid sequences illustrated in Tables It through VIII below. They are capable of inducing bone formation at a 
predetermined site. These bone inductive factors are further characterized by biochemical and biological characteristics 
including activity at a concentration of 10, to lOOOng/gram of bone in an jn vivo rat bone formation assay described 
25 below. Proteins of this invention may be encoded by the DNA sequences depicted in the Tables or by sequences 

capable of hybridizing thereto and coding for polypeptides with bone growth factor biological properties or other vari- 
ously modified sequences demonstrating such properties. 

[0004] One of the proteins of the invention is designated BMP-1 . A portion of the human BMP-1 or hBMP-1 is char- 
acterized by the same or substantially the same peptide sequence as that of amino acid #1 through amino acid #37 
30 of Table V, below which represents a genomic hBMP-1 fragment or amino acid #1 through amino acid #730 of Table 

VI which represents the hBMP-1 cDNA. hBMP-1 or a related bone inductive factor may be further characterized by at 
least a portion of these sequences. These peptide sequences are encoded by the same or substantially the same DNA 
sequence, as depicted in nucleotide #3440 through nucleotide #3550 of Table V and in nucleotide #36 through nucle- 
otide #2225 of Table VI, respectively. These hBMP-1 polypeptides are further characterized by the ability to induce 
35 bone formation. hBMP-1 demonstrates activity in an in vivo rat bone formation assay at a concentration of 1 0 to 1 000ng/ 
gram of bone. 

[0005] The homologous bovine growth factor of the invention, designated bBMP-1, is characterized by a peptide 
sequence containing the same or substantially the same sequence as that of amino acid #1 through amino acid #37 
of Table II below which represents a genomic bBMP-1 fragment. This peptide sequence Is encoded by the same or 
to substantially the same DNA sequence as depicted in nucleotide #294 through nucleotide #404 of Table II. The bovine 
peptide sequence identified in Table II below is also 37 amino acids in length. bBMP-1 is further characterized by the 
ability to induce bone formation. 

[0006] Another bone inductive protein composition of the invention is designated BMP-2 Class I (or BMP-2). It is 
characterized by at least a portion of a peptide sequence the same or substantially the same as that of amino acid #1 
45 through amino acid #396 of Table VII which represents the cDNA hBMP-2 Class I. This peptide sequence is encoded 
by the same or substantially the same DNA sequence, as depicted in nucleotide #356 through nucleotide #1543 of 
Table VII. The human peptide sequence identified in Table VII is 396 amino acids in length. hBMP-2 or related bone 
inductive proteins may also be characterized by at least a portion of this peptide sequence. hBMP-2 Class I is further 
characterized by the ability to induce bone formation. 

so [0007] The homologous bovine bone inductive protein of the invention designated bBMP-2 Class I (or bBMP-2), has 
a DNA sequence identified in Table ill below which represents the genomic sequence. This bovine DNA sequence has 
a prospective 129 amino acid coding sequence followed by approximately 205 nucleotides (a presumptive 3' non- 
coding sequence). bBMP-2, Class I is further characterized by the ability to induce bone formation. A further bone 
inductive protein composition of the invention is designated BMP-2 Class II or BMP-4. The human protein hBMP-2 
55 Class II (or hBMP-4) is characterized by at least a portion of the same or substantially the same peptide sequence 
between amino acid #1 through amino acid #408 of Table VIII, which represents the cDNA of hBMP-2 Class II. This 
peptide sequence is encoded by at least a portion of the same or substantially the same DNA sequence as depicted 
in nucleotide #403 through nucleotide #1626 of Table VIII. This factor is further characterized by the ability to induce 
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bone formation. 

[0008] Still another bone inductive factor of the invention, BMP-3, is represented by the bovine homolog bBMP-3. 
bBMP-3 is characterized by the DNA sequence and amino acid sequence of Table IV A and B which represents the 
bovine genomic sequence. It is characterized by at least a portion of a peptide sequence the same or substantially the 
5 same as amino acid #1 through amino acid #1 75 of Table IV A and B. BMP-3 is further characterized by the ability to 
induce bone formation. The bovine.factor may be employed as a tool for obtaining the analogous human BMP-3 protein 
or other mammalian bone inductive proteins. The proper characterization of this bovine bone inductive factor provides 
the essential "starting point" for the method employing this sequence. The method, employing techniques known to 
those skilled in the art of genetic engineering, involves using the bovine DNA sequence as a probe to screen a human 
10 genomic or cDNA library; and identifying the DNA sequences which hybridize to the probes. A clone with a hybridizable 
sequence is plaque purified and the DNA isolated therefrom, subcloned and subjected to DNA sequence analysis. 
Thus another aspect of this invention is a human protein hBMP-3, produced by this method. 

[0009] Another aspect of the invention provides pharmaceutical compositions containing a therapeutically effective 
amount of one or more bone growth factor polypeptides according to the invention in a pharmaceutically acceptable 
15 vehicle. These compositions may further include other therapeutically useful agents. They may also include an appro- 
priate matrix for delivering the proteins to the site of the bone defect and for providing a structure for bone growth. 
These compositions may be employed in methods for treating a number of bone defects and periodontal disease. 
These methods, according to the invention, entail administering to a patient needing such bone formation an effective 
amount of at least one of the novel proteins BMP-1 , BMP-2 Class I, BMP-2 Class-ll, and BMP-3 as described herein. 
20 [0010] Still a further aspect of the invention are DNA sequences coding on expression for a human or bovine polypep- 

tide having the ability to induce bone formation. Such sequences include the sequence of nucleotides in a 5' to 3' 
direction illustrated in Tables II through VIII. Alternatively, a DNA sequence which hybridizes under stringent conditions 
with the DNA sequences of Tables II - VIII or a DNA sequence which hybridizes under non-stringent conditions with 
the illustrated DNA sequences and which codes on expression for a protein having at least one bone growth factor 
25 biological property are included in the present invention. Finally, allelic or other variations of the sequences of tables 
II through VIII, whether such nucleotide changes result in changes in the peptide sequence or not, are also included 
in the present invention. 

[0011] Still a further aspect of the invention is a vector containing a DNA sequence as described above in operative 
association with an expression control sequence. Such vector may be employed in a novel process for producing a 
30 bone growth factor polypeptide in which a cell line transformed with a DNA sequence encoding expression of a bone 
growth factor polypeptide in operative association with an expression control sequence therefor, is cultured. This 
claimed process may employ a number of known cells as host cells for expression of the polypeptide. Presently pre- 
ferred celi lines are mammalian cell lines and bacterial cells. 

[0012] Other aspects and advantages of the present invention will be apparent upon consideration of the following 
35 detailed description and preferred embodiments thereof. 

Detailed Description of the Invention 

[001 3] The proteins of the present invention are characterized by amino acid sequences or portions thereof the same 
40 as or substantially homologous to the sequences shown in Tables II - VIII below. These proteins are also characterized 
by the ability to induce bone formation. 

[0014] The bone growth factors provided herein also include factors encoded by the sequences similar to those of 
Tables II - VIII, but into which modifications are naturally provided (e.g. allelic variations in the nucleotide sequence 
which may result in amino acid changes in the polypeptide) or deliberately engineered. For example, synthetic polypep- 
45 tides may wholly or partially duplicate continuous sequences of the amino acid residues of Tables II - VIII. These 
sequences, by virtue of sharing primary, secondary, or tertiary structural and conformational characteristics with bone 
growth factor polypeptides of Tables 1 1 - VI 1 1 may possess bone growth factor biological properties in common therewith. 
Thus, they may be employed as biologically active substitutes for naturally-occurring bone growth factor polypeptides 
in therapeutic processes. 

50 [0015] Other specific mutations of the sequences of the bone growth factors described herein involve modifications 

of one or both of the glycosylation sites. The absence of glycosylation or only partial glycosylation results from amino 
acid substitution or deletion at one or both of the asparagine-linked glycosylation recognition sites present in the se- 
quences of the bone growth factors shown in Tables H-VIII._ The asparagine-linked glycosylation recognition sites 
comprise tripeptide sequences which are specifically recognized by appropriate cellular glycosylation enzymes. These 
55 tripeptide sequences are either asparagine-X-threonine or asparagine-X-serine, where X is usually any amino acid. A 

variety of amino acid substitutions or deletions at one or both of the first or third amino acid positions of a glycosylation 
recognition site (and/or amino acid deletion at the second position) results in non-glycosylation at the modified tripeptide 
sequence. 
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[0016] The present invention also encompasses the novel DNA sequences, free of association with DNA sequences 
encoding other proteinaceous materials, and coding on expression for bone growth factors. These DNA sequences 
include those depicted in Tables II - VIII in a 5' to 3' direction and those sequences which hybridize under stringent 
hybridization conditions [see, T. Maniatis et al, Molecular Cloning (A Laboratory Manual) , Cold Spring Harbor Laboratory 
5 (1982), pages 387 to 389] to the DNA sequences of Tables II - VIII. 

[0017] DNA sequences which hybridize to the sequences of Tables II - VIII under relaxed hybridization conditions 
and which code on expression for bone growth factors having bone growth factor biological properties also encode 
bone growth factors of the invention. For example, a DNA sequence which shares regions of significant homology, e. 
g., sites of glycosylation or disulfide linkages, with the sequences of Tables II - VIII and encodes a bone growth factor 
10 having one or more bone growth factor biological properties clearly encodes a member of this novel family of growth 
factors, even if such a DNA sequence would not stringently hybridize to the sequence of Tables II - VIII. 

[0018] Similarly, DNA sequences which code for bone growth factor polypeptides coded for by the sequences of 
Tables II - VIII, but which differ in codon sequence due to the degeneracies of the genetic code or allelic variations 
(naturally-occurring base changes in the species population which may or may not result in an amino acid change) 
is also encode the novel growth factors described herein. Variations in the DNA sequences of Tables II -VIII which are 
caused by point mutations or by induced modifications to enhance the activity, half-life or production of the polypeptides 
encoded thereby are also encompassed in the invention. 

[0019] Another aspect of the present invention provides a novel method for producing the novel osteoinductive fac- 
tors. The method of the present invention involves culturing a suitable cell or cell line, which has been transformed 
20 with a DNA sequence coding on expression for a novel bone growth factor polypeptide of the invention, under the 
control of known regulatory sequences. Suitable cells or cell lines may be mammalian cells, such as Chinese hamster 
ovary cells (CHO). The selection of suitable mammalian host cells and methods for transformation, culture, amplifica- 
tion, screening and product production and purification are known in the art. See, e.g., Gething and Sambrook, Nature, 
293:620-625 (1 981 ), or alternatively, Kaufman et al, Mol. Cell. Biol. , 5(7):1 750-1 759 (1 985) or Howley et al, U.S. Patent 
25 4,419,446. Another suitable mammalian cell line, which is described in the accompanying examples, is the monkey 

COS-1 cell line. A similarly useful mammalian cell line is the CV-1 cell line. 

[0020] Bacterial cells are suitable hosts. For example, the various strains of E. coli (e.g., HB1 01, MCI 061) are well- 
known as host cells in the field of biotechnology. Various strains of B. subtilis , Pseudomonas, other bacilli and the like 
may also be employed in this method. 

30 [0021 ] Many strains of yeast cells known to those skilled in the art are also available as host cells for expression of 

the polypeptides of the present invention. Additionally, where desired, insect cells may be utilized as host cells in the 
method of the present invention. See, e.g. Miller et al, Genetic Engineering , 8:277-298 (Plenum Press 1986) and 
references cited therein. 

[0022] Another aspect of the present invention provides vectors for use in the method of expression of these novel 
35 osteoinductive polypeptides. Preferably the vectors contain the full novel DNA sequences described above which code 
for the novel factors of the invention. Additionally the vectors also contain appropriate expression control sequences 
permitting expression of the bone inductive protein sequences. Alternatively, vectors incorporating modified sequences 
as described above are also embodiments of the present invention and useful in the production of the bone inductive 
proteins. The vectors may be employed in the method of transforming cell lines and contain selected regulatory se- 
40 quences in operative association with the DNA coding sequences of the invention which are capable of directing the 
replication and expression thereof in selected host cells. Useful regulatory sequences for such vectors are known to 
one of skill in the art and may be selected depending upon the selected host cells. Such selection is routine and does 
not form part of the present invention. 

[0023] A protein of the present invention, which induces bone growth in circumstances where bone is not normally 
45 formed, has application in the healing of bone fractures. An osteogenic preparation employing one or more of the 
proteins of the invention may have prophylactic use in closed as well as open fracture reduction and also in the improved 
fixation of artificial joints. De novo bone formation induced by an osteogenic agent contributes to the repair of congenital, 
trauma induced, or oncologic resection induced craniofacial defects, and also is useful in cosmetic plastic surgery. An 
^osteogenic factor of the invention may be valuable in the treatment of periodontal disease, and in other tooth repair 
50 processes. Such agents may provide an environment to attract bone-forming cells, stimulate growth of bone-forming 
cells or induce differentiation of progenitors of bone-forming cells. Of course, the proteins of the invention may have 
other therapeutic uses. 

[0024] A further aspect of the invention is a therapeutic method and composition for repairing fractures and other 
conditions related to bone defects or periodontal diseases. Such a composition comprises a therapeutically effective 
55 amount of at least one of the bone inductive factor proteins of the invention. The bone inductive factors according to 
the present invention may be present in a therapeutic composition in admixture with a pharmaceutically acceptable 
vehicle or matrix. Further therapeutic methods and compositions of the invention comprise a therapeutic amount of a 
bone inductive factor of the invention with a therapeutic amount of at least one of the other bone inductive factors of 



4 




EP 1 254 956 A2 



the invention. 

Additionally, the proteins according to the present invention or a combination of the proteins of the present invention 
may be co-administered with one or more different osteoinductive factors with which it may interact. Further, the bone 
inductive proteins may be combined with other agents beneficial to the treatment of the bone defect in question. Such 
agents include, but are not limited to various growth factors. The preparation of such physiologically acceptable protein 
compositions, having due regard to pH, isotonicity, stability and the like, is within the skill of the art. 

[0025] In particular, BMP-1 may be used individually in a composition. BMP-1 may also be used in combination with 
one or more of the other proteins of the invention. BMP-1 and BMP-2 Class I may be used in combination. BMP-1 and 
BMP-2 Class II may also be used in combination. BMP-1 and BMP-3 may be used in combination. Furthermore, BMP- 
1 may be used in combination with two or three of the other proteins of the invention. For example, BMP-1 , BMP-2 
Class I, and BMP-2 Class II may be combined. BMP-1 may also be combined with BMP-2 Class I, and BMP-3. Further, 
BMP-1 may be combined with BMP-2 Class II, and BMP-3. BMP-1 , BMP-2 Class I, BMP-2 Class II, and BMP-3 may 
be combined. 

[0026] BMP-2 Class I may be used individually in a pharmaceutical composition. BMP-2 Class I may also be used 
in combination with one or more of the other proteins of the invention. BMP-2 Class I may be combined with BMP-2 
Class II. It may also be combined with BMP-3. Further BMP-2 Class I may be combined with BMP-2 Class II and BMP-3. 
[0027] BMP-2 Class II may be used individually in pharmaceutical composition. In addition, it may be used in com- 
bination with other proteins as identified above. Further it may be used in combination with BMP-3. 

[0028] BMP-3 may be used individually in a composition. It may further be used in the various combinations identified 
above. 

[0029] The therapeutic method includes locally administering the composition as an implant or device. When admin- 
istered, the therapeutic composition for use in this invention is, of course, in a pyrogen-free, physiologically acceptable 
form. Further, the composition may desirably be encapsulated or injected in a viscous form for delivery to the site of 
bone damage. Preferably, the bone growth inductive factor composition would include a matrix capable of delivering 
the bone inductive factor to the site of bone damage, providing a structure for the developing bone and cartilage and 
optimally capable of being resorbed into the body. Such matrices may be formed of other materials presently in use 
for other implanted medical applications. 

[0030] The choice of material is based on, for example, biocompatibility, biodegradability, mechanical properties, 
cosmetic appearance and interface properties. Similarly, the application of the osteoinductive factors will define the 
appropriate formulation. Potential matrices for the osteoinductive factors may be biodegradable and chemically defined, 
such as, but not limited to calcium sulfate, tricalciumphosphate, hydroxyapatite, polylactic acid, polyanhydrides; bio- 
degradable and biologically well defined, such as bone or dermal collagen, other pure proteins or extracellular matrix 
components; nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, aluminates, or other 
ceramics; or combinations of any of the above mentioned types of material, such as polylactic acid and hydroxyapatite 
or collagen and tricalciumphosphate. The bioceramics might also be altered in composition, such as in calcium-alumi- 
nate-phosphate and processing to alter for example, pore, size, particle size, particle shape, and biodegradability. 
[0031] The dosage regimen will be determined by the attending physician considering various factors which modify 
the action of such a growth factor, e.g. amount of bone weight desired to be. formed, the site of bone damage, the 
condition of the damaged bone, the patient's age, sex, and diet, the severity of any infection, time of administration 
and other clinical factors. The dosage may vary with the type of matrix used in the reconstitution and the composition 
of BMP's. The addition of other known growth factors, such as IGF 1 (insulin like growth factor 1 ), to the final composition, 
may also effect the dosage. Generally, the dosage regimen should be in the range of approximately 1 0 to 1 0 6 nanograms 
of protein per gram of bone weight desired. Progress can be monitored by periodic assessment of bone growth and/ 
or repair, e.g. x-rays. Such therapeutic compositions are also presently valuable for veterinary applications due to the 
lack of species specificity in bone inductive factors. Particularly domestic animals and thoroughbred horses in addition 
to humans are desired patients for such treatment with the bone inductive factors of the present invention. . 

[0032] The following examples illustrate practice of the present invention in recovering and characterizing the bovine 
proteins and employing them to recover the human proteins, obtaining the human proteins and in expressing the 
proteins via recombinant techniques. 

EXAMPLE I 



Isolation of Bovine Bone Inductive Factor 

55 [0033] Ground bovine bone powder (20-120 mesh, Helitrex) is prepared according to the procedures of M. R. Urist 

et al., Proc. Natl Acad. Sci USA , 70:3511 (1973) with elimination of some extraction steps as identified below. Ten kgs 
of the ground powder is demineralized in successive changes of 0.6N HCI at 4°C over a 48 hour period with vigorous 
stirring. The resulting suspension is extracted for 1 6 hours at 4°C with 50 liters of 2M CaCI 2 and 1 0mM ethylenediamine- 
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tetraacetic acid [EDTA], and followed by extraction for 4 hours in 50 liters of 0.5M EDTA. The residue is washed three 
times with distilled water before its resuspension in 20 liters of 4M guanidine hydrochloride [GuCI], 20mM Tris (pH 7.4), 
ImM N-ethylmaleimide, ImM iodoacetamide, ImM phenylmethylsulfonyl fluorine as described in Clin. Orthop. Rel. 
Res., 171: 213 (1982). After 16 to 20 hours the supernatant is removed and replaced with another 10 liters of GuCI 
5 buffer. The residue is extracted for another 24 hours. 

[ 0034 ] The crude GuCI extracts are combined, concentrated approximately 20 times on a PeIJicon apparatus with a 
1 0,000 molecular weight cut-off membrane, and then dialyzed in 50mM Tris, 0. 1 M NaCI, 6M urea (pH7,2), the starting 
buffer for the first column. After extensive dialysis the protein is loaded on a 4 liter DEAE cellulose column and the 
unbound fractions are collected. 

io [ 0035 ] The unbound fractions are concentrated and dialyzed against 50mM NaAc, 50mM NaCI (pH 4.6) in 6M urea. 
The unbound fractions are applied to a carboxymethyl cellulose column. Protein not bound to the column is removed 
by extensive washing with starting buffer, and the bone inductive factor containing material desorbed from the column 
by 50mM NaAc, 0.25mM NaCI, 6M urea (pH 4.6). The protein from this step elution is concentrated 20- to 40- fold, 
then diluted 5 times with 80mM KP0 4 , 6M urea (pH6.0). The pH of the solution is adjusted to 6.0 with 500mM K 2 HP0 4 . 
15 The sample is applied to an hydroxylapatite column (LKB) equilibrated in 80mM KP0 4 , 6M urea (pH6.0) and all unbound 
protein is removed by washing the column with the same buffer. Bone inductive factor activity is eluted with lOOmM 
KP0 4 (pH7.4) and 6M urea. 

[ 0036 ] The protein is concentrated approximately 10 times, and solid NaCI added to a final concentration of 0.1 5M. 
This material is applied to a heparin - Sepharose column equilibrated in 50mM KP0 4 , 1 50mM NaCI, 6M urea (pH7.4). 
20 After extensive washing of the column with starting buffer, a protein with bone inductive factor, activity is eluted by 
50mM KP0 4 , 700mM NaCI, 6M urea (pH7.4). This fraction is concentrated to a minimum volume, and 0.4ml aliquots 
are applied to Superose 6 and Superose 1 2 columns connected in series, equilibrated with 4M GuCI, 20mM Tris (pH7.2) 
and the columns developed at a flow rate of 0.25ml/min. The protein demonstrating bone inductive factor activity has 
a relative migration corresponding to approximately 30,000 dalton protein. 

25 [ 0037 ] The above fractions are pooled, dialyzed against 50mM NaAc, 6M urea (pH4.6), and applied to a Pharmacia 

MonoS HR column. The column is developed with a gradient to 1.0M NaCI, 50mM NaAc, 6M urea (pH4.6). Active 
fractions are pooled and brought to pH3.0 with 1 0% trifluoroacetic acid (TFA). The material is applied to a 0.46 x 25cm 
Vydac C4 column in 0.1 % TFA and the column developed with a gradient to 90% acetonitrile, 0.1 % TFA (31 .5% ace- 
tonitrile, 0.1% TFA to 49.5% acetonitrile, 0.1% TFA in 60 minutes at Iml per minute). Active material is eluted at ap- 
30 proximately 40-44% acetonitrile. Aliquots of the appropriate fractions are iodinated by one of the following methods: 
P. J. McConahey et al, Int. Arch. Allergy , 29:185-189 (1966); A. E. Bolton et al, Biochem J ., 133:529 (1973); and D. F. 
Bowen-Pope, J, Biol. Chem. , 237:51 61 (1 982). The iodinated proteins present in these fractions are analyzed by SDS 
gel electrophoresis and urea Triton X 100 isoelectric focusing. At this stage, the bone inductive factor is estimated to 
be approximately 10-50% pure. 

35 

EXAMPLE II 

Characterization of Bovine Bone inductive Factor 
40 A. Molecular Weight 

[ 0038 ] Approximately 20ug protein from Example I is lyophilized and redissolved in 1 X SDS sample buffer. After 1 5 
minutes of heating at 37°C, the sample is applied to a 15% SDS polyacrylamide gel and then electrophoresed with 
cooling. The molecular weight is determined relative to prestained molecular weight standards (Bethesda Research 
45 Labs). Immediately after completion, the gel lane containing bone inductive factor is sliced into 0.3cm pieces. Each 
piece is mashed and 1 .4ml of 0.1% SDS is added. -The samples are shaken gently overnight at room temperature to 
elute the protein. Each gel slice is desalted to prevent interference in the biological assay. The supernatant from each 
sample is acidified to pH 3.0 with 1 0% TFA, filtered through a 0.45 micron membrane and loaded on a 0.46cm x 5cm 
C4 Vydac column developed with a gradient of 0.1% TFA to 0.1% TFA, 90% CH 3 CN. The appropriate bone inductive 
50 factor - containing fractions are pooled and reconstituted with 20mg rat matrix. In this gel system, the majority of bone 
inductive factor fractions have the mobility of a protein having a molecular weight of approximately 28,000 - 30,000 
daltons. 

B. Isoelectric Focusing 

55 

[ 0039 ] The isoelectric point of bone inductive factor activity is determined in a denaturing isoelectric focusing system. 
The Triton XI 00 urea gel system (Hoeffer Scientific) is modified as follows: 1 ) 40% of the ampholytes used are Servalyte 
3/10; 60% are Servalyte 7-9. 2) The catholyte used is 40mM NaOH. Approximately 20ug of protein from. Example I is 
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lyophilized, dissolved in sample buffer and applied to the isoelectrofocusing gel. The gel is run at 20 watts, 10°C for 
approximately 3 hours. At completion the lane containing bone inductive factor is sliced into 0.5 cm slices. Each piece 
is mashed in 1.0ml 6M urea, 5mM Tris (pH 7.8) and the samples agitated at room temperature. The samples are 
acidified, filtered, desalted and assayed as described above. The major portion of activity as determined in the assay 
5 described in Example III migrates in a manner consistent with a pi of 8.8 - 9.2. 

C. Subunit Characterization 

[0040] The subunit composition of bone inductive factor is also determined. Pure bone inductive factor is isolated 
10 from a preparative 1 5% SDS gel as described above. A portion of the sample is then reduced with 5mM DTT in sample 
buffer and re-electrophoresed on a 15% SDS gel. The approximately 30kd protein yields two major bands at approx- 
imately 20kd and 18kd, as well as a minor band at 30kd. The broadness of the two bands indicates heterogeneity 
caused most probably by glycosylation, other post translational modification, proteolytic degradation or carbamylation. 

15 EXAMPLE III 

Biological Activity of Bone Inductive Factor 

[0041] A rat bone formation assay according to the.general procedure of Sampath and Reddi, Proc. Natl. Acad. Sci. 
20 U S A > 80:6591-6595 (1983) is used to evaluate the osteogenic activity of the bovine bone inductive factor of the 
present invention obtained in Example I. This assay can also be used to evaluate bone inductive factors of other 
species. The ethanol precipitation step is replaced by dialyzing the fraction to be assayed against water. The solution 
or suspension is then redissolved in a volatile solvent, e.g. 0.1 - 0.2 % TFA, and the resulting solution added to 20mg 
of rat matrix. This material is frozen and lyophilized and the resulting powder enclosed in #5 gelatin capsules. The 
25 capsules are implanted subcutaneously in the abdominal thoracic area of 21 - 49 day old male long Evans rats. The 
implants are removed after 7-14 days. Half of each implant is used for alkaline phosphatase analysis [See, A. H. 
Reddi et al., Proc. Natl Acad Sci. , 69:1 601 (1972)] and half is fixed and processed for histological analysis. Routinely, 
lum glycolmethacrylate sections are stained with Von Kossa and acid fuschin to detect new bone mineral. Alkaline 
phosphatase, an enzyme produced by chondroblasts and osteoblasts in the process of matrix formation, is also meas- 
30 ured. New cartilage and bone formation often correlates with alkaline phosphatase levels. Table I below illustrates the 
dose response of the rat matrix samples including a control not treated with bone inductive factor. 

TABLE 1 



35 



40 



[0042] The bone or cartilage formed is physically confined to the space occupied by the matrix. Samples are also 
45 analyzed by SDS gel electrophoresis and isoelectric focusing as described above, followed by autoradiography. Anal- 
ysis reveals a correlation of activity with protein bands at 28 - 30kd and a pi 9.0. An extinction coefficient of 1 OD/mg- 
cm is used as an estimate for protein and approximating the purity of bone inductive factor in a particular fraction. In 
the in vivo rat bone formation assays on dilutions as described above, the protein is active in vivo at 1 0 to 200ng protein/ 
gram bone to probably greater than lug protein/gram bone. 

50 EXAMPLE IV 

Bovine Bone Inductive Factor Protein Composition 

55 [0043] The protein composition of Example IIA of molecular weight 28 - 30kd is reduced as described in Example 

IIC and digested with trypsin. Eight tryptic fragments are isolated by standard procedures having the following amino 
acid sequences: 



Protein* Implanted ug 


Cartilage 


Aik. Phos.u/I 


7.5 


2 


Not done 


2.5 


3 


445.7 


0.83 


3 


77.4 


0.28 


0 


32.5 


0.00 


0 


31.0 



*At this stage the bone inductive factor is approximately 10*15% pure. 
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Fragment 1:AAFLGDIALDEEDLG 
Fragment 2: AFQVQQAADL 
Fragment 3:NYQDMVVEG 
Fragment 4:STPAQDVSR 
5 Fragment 5: N Q E A L R 

Fragment 6:LSEPDPSHTLEE 

Fragment 7: F D A Y Y 

Fragment 8:LKPSN?ATIQSIVE 

io [ 0044 ] A less highly purified preparation of protein from bovine bone is prepared according to a purification scheme 
similar to that described in Example I. The purification basically varies from that previously described by omission of 
the DE-52 column, the CM cellulose column and the mono S column, as well as a reversal in the order of the hydrox- 
ylapatite and heparin sepharose columns. Briefly, the concentrted crude 4 M extract is brought to 85% final concen- 
tration of ethanol at 4 degrees. The mixture is then centrifuged, and the precipitate redissolved in 50 mM Tris, 0.15 M 
15 NaCI, 6.0 M urea. This material is then fractionated on Heparin Sepharose as described. The Heparin bound material 
is fractionated on hydroxyapatite as described. The active fractions are pooled, concentrated, and fractionated on a 
high resolution gel filtration (TSK 30000 in 6 M guanidinium chloride, 50 mM Tris, pH 7.2). The active fractions are 
pooled, dialyzed against 0.1% TFA, and then fractionated on a C4 Vydac reverse phase column as described. The 
preparation is reduced and electrophoresed on an acrylamide gel. The protein corresponding to the 1 8K band is eluted 
20 and digested with trypsin. Tryptic fragments are isolated having the following amino acid sequences: 

Fragment 9:SLKPSN H ATIQS? V 
Fragment 10:SFDAYYCS?A 
Fragment 1 1 : V Y P N M T V E S C A 
25 Fragment 12:VDFADI?W 

[0045] Tryptic Fragments 7 and 8 are noted to be substantially the same as Fragments 10 and 9, respectively. 

A. bBMP-1 

30 

[ 0046 ] Probes consisting of pools of oligonucleotides (or unique oligonucleotides) are designed according to the 
method of R. Lathe, J. Mol. Biol ., 183 (1):1-12 (1985) and synthesized on an automated DNA synthesizer. One probe 
consists of a relatively long (32 nucleotides) "guessmer" [See J. J. Toole et al, Nature , 312:342-347 (1984)] of the 
following nucleotide sequence: 

35 

TCCTCATCCAGGGCAATGTCGCCCAGGAAGGC 

[ 0047 ] Because the genetic code is degenerate (more than one codon can code for the same amino acid), the number 
to of oligonucleotides in a probe pool is reduced based on the frequency of codon usage in eukaryotes, the relative 
stability of G:T base pairs, and the relative infrequency of the dinucleotide CpG in eukaryotic coding sequences [see 
. Toole et al., supra.]. The second set of probes consists of shorter oligonucleotides (17 nucleotides in length) which 
contain all possible sequences that could encode the amino acids. The second set of probes has the following se- 
quences: 

45 

(a) A [A/G] [A/G] TC [T/C] TC [T/C] TC [A/G] TC [T/C] AA 

(b) A [A/G] [A/G] TC [T/C] TC [T/C] TC [A/G] TCNAG 

Bracketed nucleotides are alternatives. M N K means either A, T, C or G. 
so [0048] In both cases the regions of the amino acid sequence used for probe design are chosen by avoiding highly 

degenerate codons where possible. The oligonucleotides are synthesized on an automated DNA synthesizer; the 
probes are then radioactively labeled with polynucleotide kinase and 32 P-ATP. 

[ 0049 ] These two sets of probes are used to screen a bovine genomic recombinant library. The library is constructed 
as follows: Bovine liver DNA is partially digested with the restriction endonuclease enzyme Sau 3A and sedimented 
55 through a sucrose gradient. Size fractionated DNA in the range of 15-30kb is then ligated to the bacteriophage Bam 

HI vector EMBL3 [Frischauf et al, J. Mol. Biol. , 170:827-842 (1983)]. The library is plated at 8000 recombinants per 
plate. Duplicate nitrocellulose replicas of the plaques are made and amplified according to a modification of the pro- 
cedure of Woo et al, Proc. Natl. Acad. Sci. USA , 75:3688-91 (1978). 
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[0050] The 32 mer probe is kinased with 32 P-gamma*ATP and hybridized to one set of filters in 5X SSC, 0.1% SDS, 
5X Denhardts, lOOug/ml salmon sperm DNA at 45 degrees C and washed with 5X SSC, 0.1% SDS at 45 degrees C. 
The 1 7 mer probes are kinased and hybridized to the other set of filters in 3M tetramethyiammonium chloride (TMAC), 
0.1 M sodium phosphate pH6.5, ImM EDTA, 5X Denhardts, 0.6% SDS, lOOug/ml salmon sperm DNA at 48 degrees 
5 C, and washed in 3M TMAC, 50mM Tris pH8.0 at 50 degrees C. These conditions minimize the detection of mismatches 
to the 17 mer probe pool [see, Wood et al, Proc. Natl. Acad. Sci, U.S.A.. 82:1585-1 588 (1985)]; 400,000 recombinants 
are screened by this procedure and one duplicate positive is plaque purified. DNA is isolated from a plate lysate of this 
recombinant bacteriophage designated lambda bP-50. bP-50 was deposited December 16, 1986 with the American 
Type Culture Collection, 12301 Parklawn Drive, Rockville, Maryland USA (hereinafter the "ATCC") under accession 
i° number 40295. This deposit as well as the other deposits contained herein meets the requirements of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure and 
Regulations thereunder. This bp-50 clone encodes at least a portion of the bovine bone growth factor designated 
bBMP-1 . 

[0051] The oligonucleotide hybridizing region of this bBMP-1 clone is localized to an approximately 800bp Eco Rl 
15 fragment which is subcloned into Ml 3 and sequenced by standard techniques. The partial DNA sequence and derived 
amino acid sequence of lambda bP-50 are shown below in Table II. The amino acid sequences corresponding to the 
tryptic fragments isolated from the bovine bone 28 to 30kd material are underlined in Table II. The first underlined 
portion of the sequence corresponds to tryptic Fragment 1 above from which the oligonucleotide probes are designed. 
The second underlined portion corresponds to tryptic Fragment 2 above. The predicted amino acid sequence indicates 
20 that tryptic Fragment 2 is preceded by a basic residue (R) as expected considering the specificity of trypsin. The nucleic 
acid sequence preceding the couplet CT at nucleotide positions #292-293 in Table II is presumed to be an intron 
(noncoding sequence) based on the presence of a consensus acceptor sequence (i.e., a pyrimidine rich tract, 
TCTCTCTCC, followed by AG) and the lack of a basic residue in the appropriate position of the derived amino acid 
sequence. This bBMP-1 genomic sequence appears in Table II. The presumptive bBMP-1 peptide sequence from this 
25 genomic clone is 37 amino acids in length and is encoded by the DNA sequence from nucleotide #294 through #404 
in Table II. 



30 



35 



TABLE II 

280 . 290 ' (1) 303 323 

. CCTIGCXTCT TCTCTCTCCA GOT GOC TIC CIT GGG GAC ATC GCC CIG GAC GAG GAG 

Ala Rie Leu Glv Asp lie Ala Leu Asp Glu C-Iu 

338 353 ' 368 

GAC TIG AGG GCC TIC CAA GTG GAG CAG GOT GOG GAC CTC AGA CAG OCT GCA ACC 
Asp Leu Arg Ala Fhe Gin Val Gin Gin Ala Ala Asp Leu Arg Gin Arg Ala Thr 



40 . 



383 398 (37) 414 424 
OGC AGG TCT TCC ATC AAA GCT GCA GCTACACIGG CTACAGGCCA 
Arg Arg Ser Ser lie Lys Ala Ala 



45 

B. bBMP-2 

[0052] Two probes consisting of pools of oligonucleotides are designed on the basis of the amino acid sequence of 
Fragment 3 and synthesized on an automated DNA synthesizer as described above. 

50 

Probe #1: ACNACCAT [A/G] T C [T/C] T G [A/G] A T 
Probe #2: C A [A/G] G A [T/C] ATGGTNGTNGA 

These probes are radioactively labeled and employed to screen the bovine genomic library constructed as described 
55 in part A except that the vector is lambda J1 Bam HI arms [Mullins et al Nature 308 : 856-858 (1 984).] The radioactively 
labelled 17-mer Probe #1 is hybridized to the set of filters according to the method for the 17 mer probe described in 
part A. 

[0053] 400,000 recombinants are screened by the procedure described above in Part A. One duplicate positive is 
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plaque purified and the DNA is isolated from a plate lysate of the recombinant bacteriophage designated lambda bP- 
21 . Bacteriophage bP-21 was deposited with the ATCC under accession number ATCC 4031 0 on March 6, 1 987. The 
bP-21 clone encodes the bovine growth factor designated bBMP-2. 

[0054] The oligonucleotide hybridizing region of this bBMP-2 clone is localized to an approximately 1.2 kb Sac I 
restriction fragment which is subcloned into Ml 3 and sequenced by standard techniques. The partial DNA sequence 
and derived amino acid sequence of this Sac I fragment and the contiguous Hind Ill-Sac I restriction fragment of bP- 
21 are shown below in Table III. The bBMP-2 peptide sequence from this clone is 129 amino acids in length and is 
encoded by the DNA sequence from nucleotide #1 through nucleotide #387. The amino acid sequence corresponding 
to the tryptic fragment isolated from the bovine bone 28 to 30kd material is underlined in Table III. The underlined 
portion of the sequence corresponds to tryptic Fragment 3 above from which the oligonucleotide probes for bBMP-2 
are designed. The predicted amino acid sequence indicates that tryptic Fragment 3 is preceded by a basic residue (K) 
as expected considering the specificity of trypsin. The arginine residue encoded by the CGT triplet is presumed to be 
the carboxy-terminus of the protein based on the presence of a stop codon (TAG) adjacent to it. 
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TABLE III 



5 


(1) 

• GGC 


CAC 


GAT 


GGG 


15 

AAA 


GGA 


CAC 


CCT 


CTC 


30 

CAC 


AGA 


AGA 


GAA 


AAG 


45 

CGG 




G 


H 


D . 


G 


K 


G 


H 


P 


L 


H 


R 


R 


E 


K 


R 


10 


CAA 


GCA 


AAA 


CAC 


60 

AAA 


CAG 


CGG 


AAA 


CGC 


75 

CTC 


AAG 


TCC 


AGC 


TGT 


90 

AAG 


Q 


A 


K 


H 


K 


Q 


R 


K 


R 


L 


K 


S 


S 


C 


K 




AGA 


CAC 


CCT 


TTA 


105- 

TAT 


GTG 


GAC 


TTC 


AGT 


12 0 
GAT 


GTG 


GGG 


TGG 


AAT 


135 

GAC 


15 


R 


H 


P 


L. 


•Y 


V 


D 


F 


S 


D 


V 


G 


W 


N 


D 




. TGG 


ATC 


GTT 


GCA 


150 

CCG 


CCG 


GGG 


TAT 


CAT 


165 

GCC 


TTT 


TAC 


TGC 


CAT 


180 

GGG 




W 


I 


V 


A 


P 


P 


G** 


Y 


H 


A 


F 


Y 


C 


H 


G 


20 


GAG 


TGC 


CCT 


TTT 


195 

CCC 


CTG 


GCC 


GAT 


CAC 


210 

CTT 


AAC 


TCC 


ACG 


AAT 


225 

CAT 




E 1 


C 


P 


F 


p 


L 


A 


D 


H 


L 


N 


S 


T 


N 


H 


25 


GCC 


ATT 


CTC 


CAA 


240 

ACT 


CTG 


GTC 


AAC 


TCA 


255 

GTT 


AAC 


TCT 


AAG 


ATT 


270 

CCC 




A 


I 


V 


Q 


T 


L 


V 


N 


S 


V 


N 


S 


K 


I 


P 


30 


AAG 


GCA 


TGC 


TGT 


385 

GTC 


CCA 


ACA 


GAG 


CTC 


300 

AGC 


GCC 


ATC 


TCC 


ATG 


315 

CTG 


K 


A 


C 


C 


V 


P 


T 


E 


L 


S 


A 


I 


S 


M 


L 




TAC 


CTT 


GAT 


GAG 


330 

AAT 


GAG 


AAG 


GTG 


GTA 


345 

TTA 


AAG 


AAC 


TAT 


CAG 


360 

GAC 


35 


Y 


L 


D 


E 


N 


E 


K 


V 


V 


L 


K 


N 


Y 


0 


D 


ATG 


GTT 


GTC 


GAG 


375 

GGT 


TGT 


GGG 


TGT 


(129) 397 407 

CGT TAGCACAGCA AAATAAAATA 




M 


V 


V 


E 




C 


G 


C 


R 















40 417 427 437 -447 457 

TAAATATATA TATATATATA TTAGAAAAAC AGCAAAAAAA TCAA'GTTGAC 

467 477 487 497 507 

ACTTTAATAT TTCCCAATGA AGACTTTATT TATGGAATGG AATGGAGAAA 

45 

517 527 537 547 557 

AAGAAAAACA CAGCTATTTT GAAAACTATA TTTATATCTA CCGAAAAGAA 

567 577 587 

so GTTGGGAAAA CAAATATTTT AATCAGAGAA TTATT 



55 
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C. bBMP-3 

[0055] Probes consisting of pools of oligonucleotides are designed on the basis of the amino acid sequences of the 
tryptic Fragments 9 (Probe #3), 10 (Probe #2), and 11 (Probe #1), and synthesized on an automated DNA synthesizer. 

5 

Probe #1: ACNGTCAT [A/G] T T N G G [A/G] T A 

Probe #2: C A [A/G] T A [A/G] T A N G C [A/G] T C [A/G] A A 

io Probe #3: T G [A/G/T] ATNGTNGC [A/G] T G [A/G] T T 

[0056] A recombinant bovine genomic library constructed in EM BL3 is screened by the TM AC hybridization procedure 
detailed above in part A. 400,000 recombinants are screened in duplicate with Probe #1 which has been labeled 
with 32 P. All recombinants which hybridized to this probe are replated for secondaries. Triplicate nitrocellulose replicas 
15 are made of the secondary plates, and amplified as described. The three sets of filters are hybridized to Probes #1 , 
#2 and #3, again under TMAC conditions. One clone, lambda bP-819, hybridizes to all three probes and is plaque 
purified and DNA is isolated from a plate lysate. Bacteriophage lambda bP-819 was deposited with the ATCC on June 
1 6, 1 987 under accession number 40344. This bP-819 clone encodes the bovine bone growth factor designated bBMP- 
3. 

20 [0057] The region of bP-819 which hybridizes to Probe #2 is localized and sequenced. The partial DNA and derived 

amino acid sequences of this region are shown in Table IVA. The amino acid sequences corresponding to tryptic 
Fragments 10 and 12 are underlined. The first underlined sequence corresponds to Fragment 12 while the second 
corresponds to Fragment 10. This region of bP-819, therefore, which hybridizes to Probe #2 encodes at least 111 
amino acids. This amino acid sequence is encoded by the DNA sequence from nucleotide #414 through #746. 

25 * * 



30 



35 



40 



45 




50 



55 
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TABLE IV. A. 



383 393 403 413 ( 1 ) 428 

' GAGGAGGAAG CEGTCEAOGG GGGTCC I TCT GCCTCTC-CAG AAC AAT GAG CTT CCT GC-G C-CA 

Asn .Asn Glu Leu Pro Gly Ala 

443 458 473 488 

GAA TAT CAG TAC AAG GAG GAT GAA GIA TGG GAG GAG AGG AAG CCT TAC AAG ACT 

Glu Tyr Gin Tyr Lys Glu Asp Glu Val Trp Glu Glu Arg Lys Pro Tyr Lys Thr 

503 518 533 

CTT CAG ACT CAG CCC CCT GAT AAG AGT AAG AAC AAA AAG AAA CAG AGG AAG GGA 

Leu Gin Thr Gin Pro Pro Asp Lys Ser Lys Asn lys Lys Lys Gin Arg Lys Gly 

548 -- 5.63 578 593 

CCT CAG CAG AAG ACT CAG ACG CTC CAG TIT GAT GAA CAG ACC CTG AAG AAG GCA 

Pro Gin Gin Lys Ser Gin Bir Leu Gin Hie Asp Glu Gin Thr Leu Lys Lys Ala 

608 623 638 

AGA AGA AAG CAA TGG AIT GAA CCC CGG AAT TCT GOC AGA CGG TAC CTT AAA CTG 

Arg Arg Lys Gin Trp lie Glu Pro Arg Asn cys Ala Arg Arg Tyr Leu Lys Val 

653 668 683 698 

GAC TTC GCA GAT ATT GGC TOG AGO GAA TGG AIT AIT TCC CCC AAG TCC ITC CAT 

Asp Phe Ala Asp lie Gly Trp Ser Glu Trp lie lie Ser Pro Lys Ser Hie Asp 

713 728 743 ( 111 ) 756 

GCC TAT TAC TGC TCC GGA GOG ICC CAG ITC CCC ATG CCA AAG CTAGCCATIG 

Ala Tyr Tyr Cys Ser Glv Ala Cys Gin Phe Pro MET Pro Lys 

766 776 786 

TITITICTCC TGTOCITCCC AITICCATAG 



to [0058] The region of bP-819 which hybridizes to Probe #1 and #3 is localized and sequenced. The partial DNA and 
derived amino acid sequences of this region are shown in Table IVB. The amino acid sequences corresponding to 
tryptic Fragments 9 and 11 are underlined. The first underlined sequence corresponds to Fragment 9 while the second 
underlined sequence corresponds to Fragment 11 . The peptide sequence of this region of bP-819 which hybridizes to 
Probe #1 and #3 is 64 amino acids in length encoded by nucleotide #305 through #493 of Table IVB. The arginine 
45 residue encoded by the AGA triplet is presumed to be the carboxy-terminus of the protein based on the presence of 
a stop codon (TAA) adjacent to it. The nucleic acid sequence preceding the couplet TC (positions 305-306) is presumed 
to be an intron (non-coding sequence) based on the presence of a consensus acceptor sequence (i.e. a pyrimidine- 
rich stretch, TTCTCCC i 1 1 i CGTTCCT, followed by AG) and the presence of a stop rather than a basic residue in the 
appropriate position of the derived amino acid sequence. 

so [0059] bBMP-3 is therefore characterized by the DNA and amino acid sequence of Table IV A and Table IV B. The 
peptide sequence of this clone is 175 amino acids in length and is encoded by the DNA sequence from nucleotide 
#414 through nucleotide #746 ot Table IV A and nucleotide #305 through nucleotide #493 of Table IV B. 



55 
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TABLE IV. B. 



5 284 294 304 ( 112 ) 319 

CTAACCIGIG TICTCCCITT TCCTTCCEAG TCT TIG MG CCA TCA AAT CAC GOT ACC 

Ser Leu Lvs Pro Ser Asn His Ala Thr 

334 349 364 379 

io ATC CAG ACT ATA GIG AGA GOT GIG GGG GIG GTC CCT GGA ATC CCC GAG CCT TGC 
lie Gin Ser lie Val Arg Ala Val Gly Val Val Pro Gly lie Pro Glu Pro Cys 

394 409 424 439 

TGI GIG CCA GAA AAG ATC TCC TCA CIC AGO ATC TTA TTC TIT GAT GAA AAC AAG 

is cys Val Pro Glu Lys MET Ser Ser leu Ser.. lie Leu Hie Phe Asp Glu Asn Lys 

454 .469 484 ( 175 ) 

AAT GIG CTA CTT AAA GIA TAT CCA AAC ATC ACA GIA GAG TCT TCT GCT TGC AGA 

Asn Val Val Leu Lys Val Tvr Pro Asn MET Thr Val Glu Ser Cys Ala Cys Arg 

20 

503 513 523 533 

TAACCIGGIG AAGAACTCAT CIGGATCCTT AACTCAATOG 



25 



EXAMPLE V 

30 Human Bone Inductive Factors 



A. hBMP-1 

[0060] Because the bovine and human bone growth factor genes are presumed to be significantly homologous, the 
35 bovine bBMP-1 DNA sequence of Table II (or portions thereof) is used as a probe to screen a human genomic library. 
The 800bp EcoRI fragment of the bovine genomic clone is labeled with 32 P by nick-translation. A human genomic 
library (Toole et al., supra) is plated on 20 plates at 40,000 recombinants per plate. Duplicate nitrocellulose filter replicas 
are made of each plate and hybridized to the nick-translated probe in 5 X SSC, 5 X Denhardt's, tOOug/ml denatured 
salmon sperm DNA, 0.1% SDS (the standard hybridization solution) at 50 degrees centigrade for approximately 14 
*0 hours. The filters are then washed in 1 X SSC, 0.1% SDS at 50 degrees centigrade and subjected to autoradiography. 
Five duplicate positives are isolated and plaque purified. DNA is obtained from a plate lysate of one of these recombinant 
bacteriophage, designated LP-HI. LP-HI was deposited with the ATCC on March 6, 1987 under accession number 
40311 . This clone encodes at least a portion of the human genomic bone growth factor called hBMP-1 . The hybridizing 
region of LP-HI is localized to a 2.5kb Xbal/Hindlll restriction fragment. 

45 [0061] The partial DNA sequence and derived amino acid sequence of lambda LP-HI are shown below in Table V. 

The peptide sequence from this clone is 37 amino acids in length and is encoded by the DNA sequence from nucleotide 
#3440 through nucleotide #3550. The coding sequence of Table V is flanked by approximately 28 nucleotides (a pre- 
sumptive 5’ noncoding sequence) as well as approximately 19 nucleotides (a presumptive 3' noncoding sequence. A 
comparison of the bBMP-1 sequence of Table II with the hBMP-1 genomic sequence of Table V indicates the significant 
50 homology between the two. 

[0062] Because the size of coding regions and the positions of noncoding regions is generally conserved in homol- 
ogous genes of different species, the locations of the coding and noncoding regions of the bone inductive factor genes 
may be identified. Regions of homology between the two species' genes, flanked by RNA processing signals at ho- 
mologous sites, indicate a coding region. 

55 
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TABLE V 



5 



10 



15 



3419 . 3429 3439 (1) 3454 

* CAGC CCIGGC TTCTTCITTT CICITIAGCr GCC TIT CTT GGG GAC AIT GCC CIG GAC 

Ala Hie Leu Gly Asp He Ala leu Asp 

3469 3484 3499 3514 

GAA GAG GAC CFG AGG GCC TIC CAG GTA CAG CAG GCT GIG GAT CTO AGA CGG CAC 

Glu Glu Asp Leu Arg Ala Hie Gin Val Gin Gin Ala Val Asp Leu Arg Arg His 

3529 3544 (37) 3560 3570 

. ACA GCT CGT AAG TCC TCC ATC AAA GCT GCA GGTAAGCCGG GIGCCAATGG 

T hr Ala Arg Lys Ser Ser lie Lys Ala Ala 



[0063] A probe specific for the human coding sequence given in Table V is used to identify a human cell line or tissue 
which synthesizes bone inductive factor. The probe is made according to the following method. Two oligonucleotides 
having the following sequences: 



(a) GGGAATTCTGCCTTTCTTGGGGACATTGCCCTGGACGAAGAGGACCTGAG 
25 (b) CGGGATCCGTCTGAGATCCACAGCCTGCTGTACCTGGAAGGCCCTCAGG 

are synthesized on an automated synthesizer, annealed, extended using the Klenow fragment of E. coli DNA polymer- 
ase I, digested with the restriction enzymes Eco Rl and Bam HI, and inserted into an Ml 3 vector. A single-stranded 32 P- 
labeled probe is then from template preparation of this subclone by standard techniques. Polyadenylated RNAs from 
30 various cell and tissue sources are electrophoresed on formaldehyde-agarose gels and transfered to nitrocellulose by 
the method of Toole et al., supra . The probe is then hybridized to the nitrocellulose blot in 50% formamide, 5 X SSC, 
0.1% SDS, 40 mM sodium phosphate pH 6.5, 100 ug/ml denatured salmon sperm DNA, and 5 mM vanadyl ribonucl- 
eosides at 42° C overnight and washed at 65° C in 0.2 X SSC, 0.1% SDS. Following autoradiography, the lane con- 
taining RNA from the human osteosarcoma cell line U-2 OS contains hybridizing bands corresponding to RNA species 
35 of approximately 4.3 and 3.0 kb. 

[ 0064 ] cDNA is synthesized from U-2 OS polyadenylated RNA and cloned into lambda gtl 0 by established techniques 
(Toole et al., supra ). 20,000 recombinants from this library are plated on each of 50 plates. Duplicate nitrocellulose 
replicas are made of the plates. The above described oligonucleotides are kinased with 32 P-gamma-ATP and hybridized 
to the two sets of replicas at 55° centigrade in standard hybridization solution overnight. The filters are then washed 
40 jn 1 X SSC, 0.1% SDS at 55° centigrade and subjected to autoradiography. One duplicate positive, designated lambda 
U20S-1 is plaque purified. Lambda U20S-1 was deposited with the ATCC on June 16, 1987 under accession number 
40343. 

[ 0065 ] The entire nucleotide sequence and derived amino acid sequence of the insert of lambda U20S-1 is given 
in Table VI. This cDNA clone encodes a Met followed by .a hydrophobic leader sequence characteristic of a secreted 
45 protein, and contains a stop codon at nucleotide positons 2226 - 2228. This clone contains an open reading frame of 
2190bp, encoding a protein of 730 amino acids with a molecular weight og 83kd based on this amino acid sequence. 
The clone contains sequence identical to the coding region given in Table V. This protein is contemplated to represent 
a primary translation product which is cleaved upon secretion to produce the hBMP-1 protein. This clone is therefore 
a cDNA for hBMP-1 corresponding to human gene fragment contained in the genomic hBMP-1 sequence lambda 
50 LP-Hi. It is noted that amino acids #550 to #590 of BMP-1 are homologous to epidermal growth factor and the "growth 
factor" domains of Protein C, Factor X and Factor IX. 
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TABLE VI 

10 20 30 (1) 50 

• CTAGAGGCQG CITCCCTCGC CC-CCGCOCOC- OCAGC ATC CCC C-C-C GTC GCC 030 CTG CCG 

MET Fro Gly Val Ala Arg Leu Pro 

65 80 95 . 110 

CTG CLG CIC GGG CTG CTG CIG CTC CCG CCT CCC C-C-C CC-G COG CTG GAC TTG GCC 

Leu Leu leu Gly Leu. leu Leu Leu Pro Arg Pro Gly Arg Pro Leu Asp Leu Ala 

125 140 155 

GAC TAC ACC TAT GAC CIG GOG GAG GAG GAC GAC TOG GAG CCC CTC AAC TAC AAA 

Asp Tyr Thr Tyr Asp Leu Ala Glu Glu Asp Asp Ser Glu Pro Leu Asn iyr Lys 

170 .. 185 200 *•’ 215 

GAC CCC TGC AAG GOG GOT GCC TIT CIT GGG GAC ATT GCC CTG GAC GAA GAG GAC 

Asp Pro Cys lys Ala Ala Ala Hie- Leu Gly Asp lie Ala Leu Asp Glu Glu Asp 

230 245 260 275 

CTG AGG GCC TTC CAG GTA CAG CAG GCT GIG GAT CTC AGA CGG CAC ACA GCT OGT 

leu Arg Ala Hie Gin Val Gin Gin Ala Val Asp Leu Arg Arg His Hut Ala Arg 

290 305 320 

AAG TCC TCC ATC AAA GCT GCA GIT CCA- GGA AAC ACT TOT ACC CCC AGC TGC CAG 

Lys Ser Ser He Lys Ala Ala Val Pro Gly Asn Thr Ser Thr Pro Ser Cys Gin 

335 350 365 380 

AGC ACC AAC GGG CAG CCT CAG AGG GGA GCC TGT GC-G. AGA TC-G AGA C-GT AGA TCC 

Ser Thr Asn Gly Gin Pro Gin Arg Gly Ala Cys Gly Arg Trp Arg Gly Arg Ser 

395 410 425 

CCT AGC CC-G CGG GOG GOG AOG TCC OGA CCA GAG CCT GIG TC-G CCC GAT GGG GIC 

Arg Ser Arg Arg Ala Ala Thr Ser Arg Pro Glu Arg Val Trp Pro Asp Gly Val 

440 455 470 485 

ATC COC TIT GTC AIT C-C-G GGA AAC TTC ACT GGT AGC CAG AGG GCA GIC TTC CGG 

lie Pro The Val He Gly Gly Asn Fhe Thr Gly Ser Gin Arg Ala Val Fhe Arg 

500 515 530 545 

CAG GCC A1G AGG CAC TCG GAG AAG CAC ACC TGT GTC ACC TTC CTG GAG CGC ACT 

Gin Ala MET Arg His Trp Glu lys His Thr Cys Val Thr Fhe Leu Glu Arg Thr 

560 575 590 

GAC GAG GAC AGC TAT AIT GTC TTC ACC TAT OGA CCT TGC GGG TCC TGC TCC TAC 

Asp Glu Asp Ser Tyr lie Val Fhe Thr Tyr Arg Pro Cys Gly Cys Cys Ser Tyr 

605 620 635 

GIG GGT CGC OGC GGC GGG C-C-C OCC CAG GOC ATC TCC ATC 

Val Gly Arg Arg Gly Gly Gly Pro Gin Ala He Ser lie 



650 

GGC AAG AAC TCT GAC 
Gly Lys Asn Cys Asp 
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665 680 695 

AAG TIC GGC AIT GTG GTC CAC GAG CIG GGC CAC GTC GTC GGC TTC TGG CAC GAA 

Lys Phe Gly He Val Val His Glu Leu Gly His Val Val Gly Fhe Trp His Glu 

710 725 740 755 

CAC ACT OC-G CCA GAC OSG GAC CGC CAC GIT TCC ATC GTT CGT GAG AAC ATC CAG 

His Thr Arg Pro Asp Arg Asp Arg His Val Ser lie Val Arg Glu Asn lie Gin 

770 785 800 815 

CCA GGG CAG GAG TAT AAC TTC CIG AAG ATS GAG OCT CAG GAG GTG GAG TCC CIG 

Pro Gly Gin Glu Tyr Asn Fhe. leu Lys MET Glu Pro Gin Glu Val Glu Ser leu 

830 845 860 

GGG GAG ACC TAT GAC TIC GAC AGO ATC ATS CAT TAC GCT OSG AAC ACA TTC TCC 

Gly Glu Ihr Tyr Asp Fhe Asp Ser He MET His Tyr Ala Arg Asn Thr Fhe Ser 

875 890 905 920 

AGG GGC ATC TTC CIG GAT ACC AIT GTC CCC AAG TAT GAG GIG AAC GGG GIG AAA 

Arg Gly He Fhe Leu Asp Thr He Val Pro Lys Tyr Glu Val Asn Gly Val Lys 

935 950 965 

OCT CCC AIT GGC CAA AGG ACA OSG CTC AGC AAG GGG GAC AIT GCC CAA GCC CGC 

Pro Pro He Gly Gin Arg Thr Arg Leu Ser Lys Gly Asp lie Ala Gin Ala Arg 

980 995 1010 1025 

AAG CIT TAC AAG TGC OCA GCC TCT GGA GAG ACC CIG CAA GAC AGC ACA GGC AAC 

Lys leu Tyr Lys Cys Pro Ala Cys. Gly Glu Thr Leu Gin Asp Ser Thr Gly Asn 

1040 1055 1070 1085 

TTC TCC TOC CCT GAA TAC CCC AAT GGC TAC TCT GCT CAC ATG CAC TCC GIG TCG 

Fhe Ser Ser Pro Glu Tyr Pro Asn Gly Tyr Ser Ala His MET His cys Val Trp 

1100 H15 H30 

OGC ATC TCT GTC ACA CCC GGG GAG AAG ATC ATC CIG AAC TIC ACG ICC CIG GAC 

Arg lie Ser Val Thr Pro Gly Glu Lys lie lie Leu Asn Phe Thr Ser Leu Asp 

H45 1160 1175 1190 

CIG TAC CGC AGC CGC CIG TCC TGG TAC GAC TAT GIG GAG GIG CGA GAT GGC TIC 

leu iyr Arg Ser Arg Leu Cys Trp Tyr Asp iyr Val Glu Val Arg Asp Gly Fhe 

1205 1220 1235 

.TCG AGG AAG GOG CCC CTC CGA GGC CGC TIC TGC GGG TOC AAA CIC CCT GAG CCT 

Trp Arg Lys Ala Pro Leu Arg Gly Arg Fhe Cys Gly Ser Lys Leu Pro Glu Pro 

1250 1265 1280 1295 

ATC GTC TCC ACT GAC AGC OSC CTC TCG GTT GAA TTC CGC AGC AGC AGC AAT TGG 

lie Val Ser Thr Asp Ser Arg Leu Trp Val Glu Phe Arg Ser Ser Ser Asn Trp 

1310 1325 1340 1355 

GIT GGA AAG GGC TIC TIT GCA GTC TAC GAA GCC ATC TGC GGG GGT GAT GIG AAA 

Val Gly Lys Gly Fhe Fhe Ala Val iyr Glu Ala lie Cys Gly Gly Asp Val Lys 



55 



17 




EP 1 254 956 A2 



5 



10 




20 



25 



30 



35 



40 



45 



50 



1370 1385 1400 

PA G GAC TAT GGC CAC AIT CAA TOG CCC AAC TAC CCA GAC GAT TAC OGG CCC AGC 
Lys Asp Tyr Gly His lie Gin Ser Pro Asn Tyr Pro Asp Asp Tyr Arg Pro Ser 

1415 1430 1445 1460 ' 

AAA GTC TGC ATC TGG OGG ATC CAG GIG TCP GAG GGC TIC CAC GIG GGC CTC ACA 
Lys Val Cys He Trp Arg He Gin Val Ser Glu Gly Phe His Val Gly Leu Thr 

i475 1490 1505 

TTC CAG ICC TIT GAG AIT GAG CGC CAC GAC AGC TGT GCC TAC GAC TAT CTG GAG 
Phe Gin Ser Phe Glu lie Glu Arg His Asp Ser Cys Ala Tyr Asp Tyr Leu Glu 

1520 1535 1550 1565 

GIG OGC GAC GGG CAC AGT GAG AGC AGC ACC CTC AIC GGG OGC TAC TGT GGC TAT 

Val Arg Asp Gly His Ser Glu Ser Ser Hur leu He Gly Arg Tyr Cys Gly Tyr 

1580 1595' 1610 1625 

GAG AAG CCT GAT GAC. ATC AAG AGC AOS TCC AGC CGC CTC TGG CTC AAG TIC CTC 
Glu lys Pro Asp Asp He Lys Ser Thr Ser Ser Arg leu Trp Leu Lys Fhe Val 

1640 1655 * 1670 

TCT GAC GGG TCC ATT AAC AAA GOG GGC TIT GCC CTC AAC TIT TTC AAA GAG CTG 
Ser Asp Gly Ser He Asn lys Ala Gly Fhe Ala Val Asn Phe Phe Lys Glu Val 

1685 1700 1715 1730 

GAC GAG TGC TCT OGG CCC AAC CGC GGG GGC TCT GAG CAG OGG TGC CIC AAC ACC 
Asp Glu Cys Ser Arg Pro Asn Arg Gly Gly Cys Glu Gin Arg Cys Leu Asn Thr 

1745 1760 1775 

CIG GGC AGC TAC AAG TGC AGC TGT GAC CCC GGG TAC GAG CIG GCC CCA GAC AAG 
leu Gly Ser Tyr Lys Cys Ser Cys Aisp Pro Gly IVr Glu Leu Ada Pro Asp lys 

1790 1805 1820 1835 

OGC OGC TCT GAG GCT GCT TCT GGC GGA TTC CTC ACC AAG CTC AAC C-GC TCC ATC 

Arg Arg Cys Glu Ala Ada Cys Gly Gly Fhe leu Thr Lys Leu Asn Gly Ser He 

1850 1865 1880 1895 

ACC AGC OOG GGC TGG CCC AAG GAG TAC OCC CCC AAC AAG PAC TGC ATC TGG CAG 
Thr Ser Pro Gly Trp Pro lys Glu Tyr Pro Pro Asn lys ASn Cys lie Trp Gin 

1910 1925 1940 

CIG GIG GCC CCC ACC CAG TAC CGC ATC TOC CIG CAG TIT GAC TTC TIT GAG ACA 
Leu- Val Ala Pro Thr Gin Tyr Arg lie Ser Leu C-ln Phe Asp Fhe Fhe Glu Thr 

1955 1970 1985 2000 

GAG GGC AAT GAT CTG TGC AAG TAC GAC TTC CTG GAG CTG CGC ACT GGA CTC ACA 
Glu Gly Asn Asp Val Cys Lys Tyr Asp Fhe Val Glu Val Arg Ser Gly Leu Thr 

2015 - 2030 2045 

GCT GAC TCC AAG CIG CAT GGC AAG TTC TGT GCT TCT GAG AAG CCC GAG CTC ATC 
Ala Asp Ser Lys leu His Gly Lys Phe Cys Gly Ser Glu Lys Pro Glu Val lie 
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2060 2075 2090 2105 

ACC TCC CAG TAC AAC AAC ATC CGC GIG GAG TTC AAG TCC GAC AAC ACC GIG TCC 

Thr Ser Gin Tyr Asn Asn MET Arg Val Glu Phe Lys Ser Asp Asn Thr Val Ser 

5 

2120 2135 2150 2165 

AAA AAG GGC TIC AAG GCC CAC TTC TTC TCA GAA AAG AGG CCA GOT CIG CAG CCC 

lys Lys Gly Ffre Lys Ala Kis Hie Fhe Ser Glu Lys Axg Pro Ala Leu Gin Pro 

2180 -2195 2210 

CCT OGG GGA CGC CCC CAC CAG CIC AAA TTC OGLA GIG CAG AAA AGA AAC OGG ACC 

Pro Arg Gly Arg Pro His Gin Leu Lys Hie Arg Val Gin Lys Arg Asn Arg Thr 

(730) 

2225 2235 2245 2255 2265 2275 2285 

CCC CAG TGAGGCCIGC CAGGCCICCC GGACCOCITG TTACICAGGA ACCTCACCIT GGAOGGAATG 
15 Pro Gin 

2295 2305 2315 2325 2335 2345 2355 

GGATGGGGGC TTOGGIGCCC ACCAACCCCC CACCICCACT CIGCCATICC GGCCCACCIC CCICIGGCCG 



2365 2375 2385, 2395 2405 2415 2425 

GACAGAACIG GTGCICICIT CICCCCACIG TGCCOGTCOG CGCACOGGGG ACCCITCCCC GIGCCCIACC 



2435 2445 2455 2465 2475 2485 2495 

CCCICCCATT TTGATGGIGT CIGIGACATT TCCTGITGIG AAGTAAAAGA GGGACCCCIG CGICCIGCCT 



30 



CIAGA 



B. hBMP-2: Class I and II 

35 [0066] The Hindlll-Sacl bovine genomic bBMP-2 fragment described in Example IV B. is subcloned into an Ml 3 

vector. A 32 P-labeled single-stranded DNA probe is made from a template preparation of this subclone. This probe is 
used to screen polyadenylated RNAs from various cell and tissue sources as described above in part A. A hybridizing 
band corresponding to an mRNA species of approximately 3.8 kb is detected in the lane containing RNA from the 
human cell line U-2 OS. The Hindlll-Sacl fragment is labeled with 32 P by nick translation and used to screen the nitro- 

40 cellulose filter replicas of the above-described U-2 OS cDNA library by hybridization in standard hybridization buffer 
at 65° overnight followed by washing in 1 X SSC, 0.1% SDS at 65°. Twelve duplicate positive clones are picked and 
replated for secondaries. Duplicate nitrocellulose replicas are made of the secondary plates and both sets hybridized 
to the bovine genomic probe as the primary screening was performed. One set of filters is then washed in 1 X SSC, 
0.1% SDS; the other in 0.1 X SSC, 0.1% SDS at 65°. 

45 [0067] Two classes of hBMP-2 cDNA clones are evident based on strong (4 recombinants) or weak (7 recombinants) 

hybridization signals under the more stringent washing conditions (0.1 X SSC, 0.1% SDS). All 11 recombinant bacte- 
riophage are plaque purified, small scale DNA preparations made from plate lysates of each, and the inserts subcloned 
into pSP65 and into Ml 3 for sequence analysis. Sequence analysis of the strongly hybridizing clones designated 
hBMP-2 Class I (also known as BMP-2) indicates that they have extensive sequence homology with the sequence 

50 given in Table III. These clones are therefore cDNA encoding the human equivalent of the protein encoded by the 
bBMP-2 gene whose partial sequence is given in Table III. Sequence analysis of the weakly hybridizing recombinants 
designated hBMP-2 Class II (also known as BMP-4) indicates that they are also quite homologous with the sequence 
given in Table III at the 3' end of their coding regions, but less so in the more 5' regions. Thus they encode a human 
protein of similar, though not identical, structure to that above. 

55 [0068] Full length hBMP-2 Class I cDNA clones are obtained in the following manner. The 1 .5 kb insert of one of the 

Class II subclones (11-1 0-1 ) is isolated and radioactively labeled by nick-translation. One set of the nitrocellulose replicas 
of the U-2 OS cDNA library screened above (50 filters, corresponding to 1,000,000 recombinant bacteriophage) are 
rehybridized with this probe under stringent conditions (hybridization at 65° in standard hybridization buffer; washing 
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at 65° in 0.2 X SSC, 0. 1 % SDS). All recombinants which hybridize to the bovine genomic probe which do not hybridize 
to the Class II probe are picked and plaque purified (10 recombinants). Plate stocks are made and small scale bacte- 
riophage DNA preparations made. After subcloning into Ml 3, sequence analysis indicates that 4 of these represent 
clones which overlap the original Class I clone. One of these, lambda U20S-39, contains an approximately 1 .5 kb 
insert and was deposited with the ATCC on June 1 6, 1 987 under accession number 40345. The partial DNA sequence 
(compiled from lambda U20S-39 and several other hBMP-2 Class I cDNA recombinants) and derived amino acid se- 
quence are shown below in Table VII. Lambda U20S-39 is expected to contain all of the nucleotide sequence necessary 
to encode the entire human counterpart of the protein BMP-2 Class I encoded by the bovine gene segment whose 
partial sequence is presented in Table III. This human cDNA hBMP-2 Class I. contains an open reading frame of 1188 
bp, encoding a protein of 396 amino acids. This protein of 396 amino acids has a molecular weight of 45kd based on 
this amino acid sequence. It is contemplated that this sequence represents the primary translation product. The protein 
is preceded by a 5' untranslated region of 342 bp with stop codons in all frames. The 13 bp region preceding this 5' 
untranslated region represents a linker used in the cDNA cloning procedure. 
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10 20 30 40 50 60 70 

GTOGACTCIA GACTGTICTGT CAGCACTIGG CIC-C-GGACIT CITGAACITG CAGGGAGAAT AACTIGOGCA 



80 90 100 110 120 130 140 

CCCCACITTG CGCCGGIGCC TTIGCCCCAG OGGAGCCIC-C TIOGCCATCT CCGAGOCOCA C 0 GCCCCICC 



150 160 170 180 190 200 210 

ACTCCIOGGC CTIGCCOGAC ACIGAGAOGC TGTTCCCAGG GTGAAAAGAG AGACIGOGOG GCCGGCACCC 



220 230 240 250 260 270 280 

GGGAGAAGGA GGAGGCAAAG AAAAGGAACG GACATIOGGT CCITGCGCCA GGTCCITIGA CCAGAGT TTT 



290 _ 300 310 320 330 340 350 

TCCATCIGGA OSCTCITTCA ATC-GAOSIGT CCCOGOGTGC TTCITAGAGG GACIGOGGTC TCCIAAAGCT 



25 (!) 370 385 400 

OGACC ATG CTG C-CC GGG ACC CGC TGT CIT CIA GOG TIG CIG CIT CCC CAG GTC 

MET Val Ala Gly Dir Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 

415 430 445 

CTC CIG GGC GGC GOG GCT GGC CIO GIT COG GAG CIG GGC CGC AGG AAG TTC GGG 
30 leu leu Gly Gly Ala Ala Gly Leu Val Pro C-lu Leu Gly Arg Arg Lys Rie Ala 

460 475 490 505 

C-OG GOG TOG TOG GGC CGC CCC TCA TOC CAG CCC TCT GAC GAG GTC CIG AGC GAG 

Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val Leu Ser Glu 

35 

520 535 550 - 565 

TTC GAG TIG OGG CIG CTC AGC ATG TTC GGC CIG AAA CAG AGA CCC ACC CCC AGC 

Phe Glu Leu Arg Leu Leu Ser MET Hie Gly Leu Lys Gin Arg Pro Thr Pro Ser 

580 595 610 

AGG GAC GOC CTG CTG CCC CCC TAC ATG CTA GAC CIG TAT CGC AGG CAC TOG GGT 

Arg Asp Ala Val Val Pro Pro Tyr MET leu Asp Leu Tyr Arg Arg His Ser Gly 

625 640 655 670 

CAG COS GGC TCA CCC GCC CCA GAC CAC OGG TIG GAG AGG GCA GCC AGC OGA C-CC 
45 Gin Pro Gly Ser Pro Ala Pro Asp His Arg Leu Glu Arg Ala. Ala Ser Arg Ala 

685 700 715 

AAC ACT GIG CGC AGC TTC CAC CAT GAA GAA TCT TTG GAA GAA CEA CCA GAA AGG 

Asn Dir Val Arg Ser Phe His His Glu Glu Ser Leu Giu Glu Leu Pro Glu Thr. 

50 
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730 745 760 775 

ACT GGG AAA ACA ACC OGG AGA TTC TIC TIT AAT TEA ACT TCT ATC CCC AOG GAG 

Ser Gly Lys /air Ihr Arg Arg Fhe Fhe Fhe Asn Leu Ser Ser lie Fro Thr Glu 

790 805 820 . 835 

GAG TIT ATC ACC TCA GCA GAG CIT CAG GIT TTC OGA GAA CAG ATS CAA GAT GCT 

Glu Hie He Thr Ser Ala ' Glu Leu Gin Val- Fhe Arg Glu Gin MET Gin Asp Ala 

850 865 . 880 

TEA GGA AAC AAT AGC ACT TTC CAT CAC CGA ATT AAT AIT TAT GAA ATC AIA AAA 
Leu Gly Asn Asn Ser Ser Hie His His Arg lie Asn He Tyr Glu He He Lys 

895 910 925 940 

CCT GCA ACA GOC AAC TOG AAA TTC CCC GIG ACC ACT CIT TIG GAC ACC AGG TTG 
Pro Ala air Ala Asn Ser Lys Fhe Pro Val Thr Ser leu Leu Asp air Arg Leu 

955 970 985 

GIG AAT CAG AAT GCA AGC AGG TGG GAA ACT TIT GAT GTC ACC CCC GCT GIG ATG 
Val Asn Gin Asn Ala Ser Arg TTp Glu Ser Hie Asp Val Thr Pro Ala Val MET 

1000 1015 1030 1045 

OGG TGG ACT GCA CAG GGA CAC' GOC AAC CAT GGA TTC GIG GIG GAA GIG GCC CAC 

Arg Tip Thr Ala Gin Gly His Ala Asn His Gly Phe Val Val Glu Val Ala His 

1060 1075 1090 H05 



TIG GAG 


GAG AAA CAA GCT 


GTC 


TCC AAG AGA 


CAT GIT AGG AIA AGC 


AGG TCT 


TIG 


Leu Glu 


Glu lys Gin Gly Val 


Ser Lys Arg 


His Val Arg He Ser 


Arg Ser 


Leu 






1120 






1135 






1150 








CAC CAA 


GAT GAA 


CAC AGC TGG 


TCA CAG 


ATA 


AGG CCA TIG CIA 


GTA 


ACT ITT 


GGC 


His Gin 


Asp Glu 


His ' Ser ' Trp 


Ser Gin 


lie 


Arg Pro Leu leu 


Val 


Thr Phe 


Gly 


1165 






1180 








1195 






1210 




CAT GAT 


GGA AAA 


GGG CAT 


CCT 


CTC 


CAC 


AAA 


AGA 


GAA AAA OCT 


CAA 


GCC 


AAA 


CAC 


His Asp 


Gly Lys 


Gl y His 


Pro 


Leu His 


lys 


Arg 


Glu Lys Arg 


Gin 


Ala 


Lys 


His 




1225 






. 1240 






1255 










AAA CAG 


CGG 


AAA 


OGC CIT 


AAG 


TCC 


AGC 


TCT 


AAG 


AGA CAC 


CCT 


TIG 


TAC 


GIG 


GAC 


Lys Gin 


Arg 


lys 


Arg Leu 


Lys 


Ser 


Ser 


cys 


Lys 


Arg His 


Pro 


Leu 


Tyr 


Val 


Asp 


1270 






1285 








1300 






1315 






TTC ACT 


GAC 


GIG 


GGG TGG 


AAT 


GAC 


TGG 


AIT GTC 


GCT CCC 


COG 


GGG 


TAT 


CAC 


GCC 


Phe Ser 


Asp 


Val 


Gly Trp 


Asn 


Asp 


Trp 


lie Val 


Ala Pro 


Pro 


Gly 


Tyr 


His 


Ala 


1330 






1345 








1360 








1375 


tit tac 


TGC 


CAC 


GGA GAA 


TGC 


CCT 


TIT 


CCT CTG 


GCT GAT 


CAT 


CIG 


AAC 


TCC ACT 


Fhe Tyr 


Cys 


His 


Gly Glu 


Cys. Pro 


Phe 


Pro Leu 


Ala Asp 


His 


leu 


Asn 


Ser Thr 






1390 






1405 






1420 








AAT CAT 


GOC 


AIT 


GIT CAG 


ACE TIG 


GTC AAC TCT 


GIT AAC 


TCT AAG 


■ATT 


CCT AAG 


Asn His 


Ala 


He 


Val Gin 


Thr Leu 


Val Asn Ser 


Val Asn 


Ser Lys 


lie 


Pro Lys 
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1435 1450 1465 1480 

C-O. TCC TCT GTC COG A CA GAA CTC ACT GOT ATC TOG ATC CTG TAC CTT GAC GAG 
5 Ala Cys Cys Val Pro Thr Glu Leu Sar Ala lie Sar MET Leu Tyr Leu Asp Glu 

1495 1510 1525 

AAT GAA AAG GIT CTA TIA AAG AAC TAT CAG GAC ATC GIT GTC GAG GGT TCT GGG 
Asn Glu Lys Val Val Leu Lys Asn Tyr Gin Asp MET Val Val Glu Gly Cys Gly 

W 1540(396) 1553 1563 1S73 1583 1593 1603 

TCP CGC TAGIACAGCA AAATEAAATA CATAAATATA TAIA5MAIA TATA ITIT AG AAAAAAGAAA 
Cys Arg 



15 AAAA 



[ 0069 ] Full-length hBMP-2Class II human cDNA clones are obtained in the following manner The 200 bp EcoRI-SacI 
20 fragment from the 5' end of the Class II recombinant 11-10-1 is isolated from its plasmid subclone, labeled by nick- 
translation, and hybridized to a set of duplicate nitrocellulose replicas of the U-2 OS cDNA library (25 filters/set; rep- 
resenting 500,000 recombinants). Hybridization and washing are performed under stringent conditions as described 
above. 16 duplicate positives are picked and repiated for secondaries. Nitrocellulose filter replicas of the secondary 
plates are made and hybridized to an oligonucleotide which was synthesized to correspond to the sequence of 11-10-1 
25 and is of the following sequence: 

' CGGGCGCTCAGGATACTCAAGACCAGTGCTG 

30 Hybridization is in standard hybridization buffer AT 50° C with washing at 50° in 1 X SSC, 0.1% SDS. 14 recombinant 
bacteriophage which hybridize to this oligonucleotide are plaque purified. Plate stocks are made and small scale bac- 
teriophage DNA preparations made. After sucloning 3 of these into Ml 3, sequence analysis indicates that they repre- 
sent clones which overlap the original Class II clone. One of these, lambda U20S-3, was deposited with the ATCC 
under accession number 40342 on June 16,1 987. U20S-3 contains an insert of approximately 1 .8 kb. The partial DNA 
35 sequence and derived amino acid sequence of U20S-3 are shown below in Table VIII. This clone is expected to contain 
all of the nucleotide sequence necessary to encode the entire human BMP-2 Class II protein. This cDNA contains an 
open reading frame of 1224 bp, encoding a protein of 408 amino acids, preceded by a 5' untranslated region of 394 
bp with stop codons in all frames, and contains a 3' untranslated region of 308 bp following the in-frame stop codon. 
The 8 bp region preceding the 5' untranslated region represents a linker used in the cDNA cloning procedure. This 
*0 protein of 408 amino acids has molecular weight of 47kd and is contemplated to represent the primary translation 
product. 
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TABLE VIII 

30 40 50 60 70 

AGGG.AGG GAAGGAGOGC GGAGCCCGGC CCGGAAGCIA CCTtLAGICTG 



10 



80 90 100 110 120 130 140 

GCATCOGAGC TC.4GGGACGC GAGOCIGAGA CGOOGCIGCT GCTCGGGCIG ACTAICIAGC TICTCTCCCC 

150 160 170 180 190 200 210 

GATGC-GAITC COCTOCAAGC TATCIOGAGC CIGCAGOGCC ACACTCCCCG GCCCIGGCOC AGGTICACIG 



15 220 230 240 . 250 260 270 280 

CAACCGTICA. GAGGTCCCCA C-GAGCTGCIG' CTGG02AGCC CGCEACTGCA GGGACCIA1G GAGCCA1TCC 



20 



290 300 310 320 330 340 350 

CTAGTCCCAT OCOGAGCAAC GCACIGCIGC AGCITCCCTS AGCCIHCCA GCAAGITIGT TCAAGATIGG 



25 



30 



35 



40 



45 



360 370 380 390 400 (1) 

CICTCAAGAA TCAIGGACIG TIATTAIAIG CCTIGITTTC TGTCAAGACA CC AIG ATT CCT 

.MET lie Pro 

417 432 447 462 

OCT AAC OGA AIG CDS AIG GTC GIT TIA TIA TGC CAA GTC CTG CIA GGA GGC GOG 
Gly Asn Arg MET Leu MET Val Val Lsu Leu Cys Gin Val leu Leu Gly Gly Ala 

477 492 507 

AGO CAT GCT ACT TIG AIA OCT GAG ACG GGG AAG AAA AAA GTC GCC GAG ATT CAG 
Ser His Ala Ser leu lie Pro Glu Cur Gly lys Lys Lys Val Ala Glu lie Gin 

522 537 552 567 

GGC CAC GOG GGA GGA OGC CGC TCA' GGG CAG AGC CAT GAG CTC CIG OGG GAC TTC 

Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu Leu Lsu Arg Asp Fhe 

582 • 597 612 621 

GAG C-OG ACA CIT CIG CAG AIG TIT GGG CIG CISC OGC OGC COG CAG CCT AGC AAG 
Glu Ala Cur Leu leu Gin MET Rie Gly Leu Arg Arg Arg Pro Gin Pro Ser lys 

642 657 672 

ACT GCC GTC AIT COG GAC TAC AIG CGG GAT CIT TAC OGG CIT CAG TCT GGG GAG 
Ser Ala Val lie Pro Asp iyr MET Arg Asp leu iyr Arg Leu Gin Ser Gly Glu 

, 687 702 717 732 

GAG GAG GAA GAG CAG ATC CAC AGC ACT GCT CIT GAG TAT CCT GAG OGC COG GCC 
Glu Glu Glu Glu Gin lie His Ser Cur Gly Leu Glu Tyr Pro Glu Arg Pro Ala 
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747 762 , 777 

AGC CGG GCC AAC ACC GIG AGG AGO TIC CAC CAC GAA GAA CAT CIG GAG AAC AIC 

Ser Arg Ala Asn Thr Val Arg Ser Fhe His His Glu Glu His Leu Glu Asn lie 

792 807 822 837 

CCA GGG ACC ACT GAA AAC TCI GCT TIT CGT TTC CTC TIT AA.C CIC AGC AGC ATC 

Pro Gly Thr Ser Glu Asn Ser Ala Fhe Arg Ehe Leu Ehe Asn Leu Ser Ser lie 

852 867 882 897 

CCT GAG AAC GAG GIG AIC TCC TCT GCA GAG CIT CGG CIC TTC CGG GAG CAG GIG 

Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Fhe Arg Glu Gin Val 

912 927 942 

GAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC AIT TAT GAG GIT ■ 

Asp Gin Gly Pro Asp Tip Glu Arg Gly Ehe His Arg lie Asn lie Tyr Glu Val 

957 972 987 1002 

ATG AAG CCC CCA GCA GAA GIG GIG OCT GGG CAC CTC ATC ACA CGA CIA CIG GAC 

MET Lys Pro Pro Ala Glu Val Val Pro Gly His leu He Thr Arg leu Leu Asp 

1017 1032 1047 

AOG AGA CIG GTC CAC CAC AAT GIG ACA CGG TGG GAA ACT TIT GAT GIG AGC CCT 

Thr Arg leu Val His His Asn Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro 

1062 1077 1092 1107 

C-OG GTC CIT CGC TGG ACC CGG GAG AAG CAG CCA AAC TAT GGG CIA GOC ATT GAG 

Ala Val Leu Arg Trp Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala lie Glu 

H22 ' 1137 1152 H67 

GIG ACT CAC CTC CAT CAG ACT CGG ACC CAC CAG GGC CAG CAT GTC AGG AIT' AGC 

Val Thr His Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg lie Ser 

1182 1197 1212 

CGA TOG TTA CCT CAA GGG ACT GGG AAT TGG GCC CAG CIC OGG CCC CTC CIG GIC 

Arg Ser Ifeu Pro Gin Gly Ser Gly Asn Trp Ala Gin leu Arg Pro Leu Leu Val 

1227 1242 1257 1272 

ACC TTI GGC CAT GAT GGC OGG C-GC CAT C-CC TIG ACC CGA CGC CGG AGG GCC AAG 

Thr Fhe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys 

1287 1302 1317 

CCT AGC CCT AAG CAT CAC TCA CAG OGG GCC AGG AAG AAG AAT AAG AAC TGC CGG 

Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys Arg 

1332 1347 1362 1377 

CGC CAC TOG CIC TAT GIG GAC TTC AGC GAT GIG GGC TGG AAT GAC TGG ATT GIG 

Arg His Ser Leu Tyr Val Asp Ehe Ser Asp Val Gly Trp Asn Asp Trp lie Val 

1392 1407 1422 1437 

GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC TGC CCC ITT CCA CIG 

Ala Pro Pro Gly Tyr Gin Ala Ehe Tyr Cys His Gly Asp Cys Pro Ehe Pro Leu 
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1452 1467 1482 

GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC AIT GTG CAG ACC CIG GTC AAT TCI 
5 Ala Asp His Lsu Asn Ser Thr Asn His Ala lie Val Gin Thr Leu Val Asn Ser 

1497 1512 1527 1542 ' 

GTC AAT TCC ACT ATC CCC AAA GCC T GT? TCT GTG OCC ACT GAA CIG ACT GOC ATC 

Val Asn Ser Ser lie Pro Lys Ala Cys Cys Val Pro Hir Glu Leu Ser Ala lie 

10 1557 1572 1587 

TCC ATC CIG TAC CIG GAT GAG TAT GAT AAG GTC GTA CIG AAA AAT TAT CAG GAG 

Ser, MET Leu Tyr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu 

1602 1617 ( 408 ) 1636 1646 1656 

15 ATC GTA GIA GAG GGA TGT GGG TCC C GO TGAGATCAGG CAGTCCITGA GGAIAGACAG 

MET Val Val Glu Gly Cys Gly Cys Arg 

1666 1676 1686 1696 1706 1716 1726 

ATATACACAC CACACACACA CACCACATAC ACCACACACA CACGTICCCA TCCACICACC CACACACIAC 

20 

1736 1746 1756 1766 1776 1786 1796 

ACAGACIGCT TCCTEATAGC TCGACTITTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA ATOCCIAAAC 

1806 1816 1826 1836 1846 1856 1866 

ATTCACCTIG ACCTIATTIA TCACITTAOG TCCAAATCTT TIGACCATAT TCATCATATA TTTIGACAAA 

137 6 1886 1896 1906 1916 1926 1936 

30 - ATATATTIAT AACTAOGTAT TAAAAGAAAA AAATAAAATC AGTCATTATT TTAAAAAAAA AAAAAAAACT 

1946 

CEAGAGTOGA OGGAATIC 
35 



[0070] The sequences of BMP-2 Class I and II, as well as BMP-3 as shown in Tables III, IV, VII and VIII have significant 
40 homology to the beta (B) and beta (A) subunits of the inhibins. The inhibins are a family of hormones which are presently 
being investigated for use in contraception. See, A. J. Mason et al, Nature , 318:659-663 (1985). To a lesser extent 
they are also homologous to Mullerian inhibiting substance (MIS), a testicular glycoprotein that causes regression of 
the Mullerian duct during development of the male embryo and transforming growth factor-beta (TGF-b) which can 
inhibit or stimulate growth of cells or cause them to differentiate. Furthermore, the sequence of Table VII encoding 
45 hBMP-2 Class II has significant homology to the Drosophila decapentaplegic (DPP-C) locus transcript. See, J. Mas- 
sag ue, Cell, 49:437-438 (1987); R. W. Padgett et al, Nature, 325:81-84 (1987); R. L. Cate etal, Cell 45: 685-698 (1986). 
It is considered possible therefore that BMP-2 Class II is the human homolog of the protein made from this transcript 
from this developmental mutant locus. 

so C. BMP-3 

[0071] Because bovine and human bone growth factor genes are presumed to be significantly homologous, oligo- 
nucleotide probes which have been shown to hybridize to the bovine DNA sequence of Table IV.A and IV.B are used 
to screen a human genomic library A human genomic library (Toole et al., supra ) is screened using these probes, and 
55 presumptive positives are isolated and DNA sequence obtained as described above. Evidence that this recombinant 
encodes a portion of the human bone inductive factor molecule relies on the bovine/human protein and gene structure 
homologies. 

[0072] Once a recombinant bacteriophage containing DNA encoding a portion of the human BMP-3 molecule is 
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obtained the human coding sequence is used as a probe as described in Example V (A) to identify a human cell line 
or tissue which synthesizes BMP-3. mRNA is selected by oligo (dT) cellulose chromatography and cDNA is synthesized 
and cloned in lambda gtl 0 by established techniques (Toole et al., supra) . 

[0073] Alternatively, the entire gene encoding this human bone inductive factor can be identified and obtained in 
5 additional recombinant clones if necessary. Additional recombinants containing further 3’ or 5' regions of this human 
bone inductive factor gene can be obtained by identifying unique DNA sequences at the end(s) of the original clone 
and using these as probes to rescreen the human genomic library. The gene can then be reassembled in a single 
plasmid by standard molecular biology techniques and amplified in bacteria. The entire human BMP-3 factor gene can 
then be transferred to an appropriate expression vector. The expression vector containing the gene is then transfected 
10 into a mammalian cell, e.g. monkey COS cells, where the human gene is transcribed and the RNA correctly spliced. 
Media from the transfected cells are assayed for bone inductive factor activity as described herein as an indication 
that the gene is complete. mRNA is obtained from these cells and cDNA synthesized from this mRNA source and 
cloned. The procedures described above may similarly be employed to isolate other species' bone inductive factor of 
interest by utilizing the bovine bone inductive factor and/or human bone inductive factor as a probe source. Such other 
15 species' bone inductive factor may find similar utility in, inter alia, fracture repair. 

EXAMPLE VI 

Expression of Bone Inductive Factors. 

20 

[0074] In order to produce bovine, human or other mammalian bone inductive factors, the DNA encoding it is trans- 
ferred into an appropriate expression vector and introduced into mammalian cells by conventional genetic engineering 
techniques. 

[0075] One skilled in the art can construct mammalian expression vectors by employing the sequence of Tables 
25 ll-VIII or other modified sequences and known vectors, such as pCD [Okayama et al., Mol. Cell Biol. ', 2:161-170 (1982)] 
and pJL3, pJL4 [Gough et al., EMBO J., 4:645-653 (1985)]. The transformation of these vectors into appropriate host 
cells can result in expression of osteoinductive factors. One skilled in the art could manipulate the sequences of Tables 
ll-VIII by eliminating or replacing the mammalian regulatory sequences flanking the coding sequence with bacterial 
sequences to create bacterial vectors for intracellular or extracellular expression by bacterial cells. For example, the 
30 coding sequences could be further manipulated (e.g. ligated to other known linkers or modified by deleting non-coding 
sequences there-from or altering nucleotides therein by other known techniques). The modified bone inductive factor 
coding sequence could then be inserted into a known bacterial vector using procedures such as described in T. Tan- 
iguchi et al., Proc. Natl Acad. Sci. USA , 77:5230-5233 (1980). This exemplary bacterial vector could then be trans- 
formed into bacterial host cells and bone inductive factor expressed thereby. For a strategy for producing extracellular 
35 expression of bone inductive factor in bacterial cells., see, e.g. European patent application EPA 177,343. 

[0076] Similar manipulations can be performed for the construction of an insect vector [See, e.g. procedures de- 
scribed in published European patent application 155,476] for expression in insect cells. A yeast vector could also be 
constructed employing yeast regulatory sequences for intracellular or extracellular expression of the factors of the 
present invention by yeast cells. [See, e.g., procedures described in published PCT application W086/00639 and 
40 European patent application EPA 123,289]. 

[0077] A method for producing high levels of an osteoinductive factor of the invention from mammalian cells involves 
the construction of cells containing multiple copies of the heterologous bone inductive factor gene. The heterologous 
gene can be linked to an amplifiable marker, e.g. the dihydrofolate reductase (DHFR) gene for which cells containing 
increased gene copies can be selected for propagation in increasing concentrations of methotrexate (MTX) according 
45 to the procedures of Kaufman and Sharp, J. Mol. Biol. , 159:601-629 (1982). This approach can be employed with a 
number of different cell types. 

[0078] For example, a plasmid containing a DNA sequence for a bone inductive factor of the invention in operative 
association with other plasmid sequences enabling expression thereof and the DHFR expression plasmid pAdA26SV 
(A) 3 [Kaufman and Sharp, Mol. Cell. Biol. , 2:1304 (1982)] can be co-introduced into DHFR-deficient CHO cells, 
so DUKX-BII, by calcium phosphate coprecipitation and transfection. DHFR expressing transformants are selected for 
growth in alpha media with dialyzed fetal calf serum , and subsequently selected for amplification by growth in increasing 
concentrations of MTX (sequential steps in 0.02, 0.2, 1 .0. and 5uM MTX) as described in Kaufman et al., Mol Cell Biol. . 
5:1750 (1983). Transformants are cloned, and biologically active bone inductive factor expression is monitored by rat 
bone formation assay. Bone inductive factor expression should increase with increasing levels of MIX resistance. Sim- 
55 liar procedures can be followed to produce other bone inductive factors. 

[0079] Alternatively, the human gene is expressed directly, as described above. Active bone inductive factor may be 
produced in bacteria or yeast cells. However the presently preferred expression system for biologically active recom- 
binant human bone inductive factor is stably transformed CHO cells. 
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[0080] As one specific example, to produce the human bone inductive factor (hBMP-1) of Example V, the insert of 
U20S-1 is released from the vector arms by digestion with Sal I and subcloned into the mammalian expression vector 
pMT2CX digested with Xho I. Plasmid DNA from this subclone is transfected into COS cells by the DEAE-dextran 
procedure [Sompayrac and Danna PNAS 78:7575-7578 (1981); Luthman and Magnusson, Nucl.Acids Res . 11: 
5 1 295-1 308 (1 983)]. Serum-free 24 hr. conditioned medium is collected from the cells starting 40-70 hr. post-transfection. 

[0081] The mammalian expression vector pMT2 Cla-Xho (pMT2 CX) is a derivative of p91023 (b) (Wong et al., 
Science 228:810-815, 1985) differing from the latter in that it contains the ampicillin resistance gene in place of the 
tetracycline resistance gene and further contains a Xhol site for insertion of cDNA clones. The functional elements of 
pMT2 Cla-Xho have been described (Kaufman, R.J., 1985, Proc. Natl. Acad. Sci. USA 82:689-693) and include the 
io adenovirus VA genes, the SV40 origin of replication including the 72 bp enhancer, the adenovirus major late promoter 
including a 5' splice site and the majority of the adenovirus tripartite leader sequence present on adenovirus late mR- 
NAs, a 3' splice acceptor site, a DHFR insert, the SV40 early polyadenylation site (SV40), and pBR322 sequences 
needed for propagation in E. coli. 

[0082] Plasmid pMT2 Cla-Xho is obtained by EcoRI digestion of pMT2-VWF, which has been deposited with the 
15 American Type Culture Collection (ATCC), Rockville, MD (USA) under accession number ATCC 67122. EcoRI digestion 
excises the cDNA insert present in pMT2-VWF, yielding pMT2 in linear form which can be ligated and used to transform 
E. coli HB 1 01 or DH-5 to ampicillin resistance. Plasmid pMT2 DNA can be prepared by conventional methods. pMT2CX 
is then constructed by digesting pMT2 with Eco RV and Xbal, treating the digested DNA with Klenow fragment of DNA 
polymerase I, and ligating Cla linkers (NEBiolabs, CATCGATG). This removes bases 2266 to 2421 starting from the 
20 Hind ill site near the SV40 origin of replication and enhancer sequences of pMT2. Plasmid DNA is then digested with 
EcoRI, blunted as above, and ligated to.an EcoRI adapter, 

5 1 PO^-AATTCCTCGAGAGCT 3' 

25 

3 ' GGAGCTCTCGA 5 1 

digested with Xhol, and ligated, yielding pMT2 Cla-Xho, which may then be used to transform E. coli to ampicillin 
30 resistance. Plasmid pMT2 Cla-Xho DNA may be prepared by conventional methods. 

Example VII 

Biological Activity of Expressed Bone Inductive Factor 
35 

A. BMP-1 

[0083] To measure the biological activity of the expressed bone inductive factor (hBMP-1) obtained in Example VI 
above. The factor is partially purified on a Heparin Sepharose column. 4 ml of transfection supernatant from one 100 
40 mm dish is concentrated approximately 1 0 fold by ultrafiltration on a YM 1 0 membrane and then dialyzed against 20mM 
Tris, 0.15 M NaCI, pH 7.4 (starting buffer). This material is then applied to a 1.1 ml Heparin Sepharose column in 
starting buffer. Unbound proteins are removed by an 8 ml wash of starting buffer, and bound proteins, including BMP- 
1 , are desorbed by a 3-4 ml wash of 20 mM Tris, 2.0 M NaCI, pH 7.4. 

[0084] The proteins bound by the Heparin column are concentrated approximately 10-fold on a Centricon 10 and 
45 the salt reduced by diafiltration with 0.1 % trifiuoroacetic acid. The appropriate amount of this solution is mixed with 20 

mg of rat matrix and then assayed for in vivo bone and cartilage formation as previously described in Example III. A 
mock transfection supernatant fractionation is used as a control. 

[0085] The implants containing rat matrix to which specific amounts of human BMP-1 have been added are removed 
from rats after seven days and processed for histological evaluation. Representative sections from each implant are 
50 stained for the presence of new bone mineral with von Kossa and acid fuschin, and for the presence of cartilage- 

specific matrix formation using toluidine blue. The types of cells present within the section, as well as the extent to 
which these cells display phenotype are evaluated. 

[0086] Addition of human BMP-1 to the matrix material resulted in formation of cartilage-iike nodules at 7 days post 
implantation. The chondroblast-type cells were recognizable by shape and expression of metachromatic matrix. The 
55 amount of activity observed for human BMP-1 was dependent upon the amount of human BMP-1 protein added to the 
matrix. Table IX illustrates the dose-response relationship of human BMP-1 protein to the amount of bone induction 
observed. 
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Table IX 



IMPLANT NUMBER 


AMOUNT USED 

(equivalent of ml transfection media) 


HISTOLOGICAL SCORE 


876-134-1 


10 BMP-1 


. C+2 


876-134-2 


3 BMP-1 


C+1 


876-134-3 


1 BMP-1 


c +/- 


876-134-4 


10 MOCK 


c- 


876-134-5 


3 MOCK 


C- 


876-134-6 


1 MOCK 


C- 



Cartilage (c) activity was scored on a scale from O(-) to 5. 

[0087] Similar levels of activity are seen in the Heparin Sepharose fractionated COS cell extracts. Partial purification 
is accomplished in a similar manner as described above except that 6 M urea is included in all the buffers. Further, in 
a rat bone formation assay as described above, BMP-2 has similarly demonstrated chondrogenic activity. 

[0088] The procedures described above may be employed to isolate other bone inductive factors of interest by uti- 
lizing the bovine bone inductive factors and/or human bone inductive factors as a probe source. Such other bone 
inductive factors may find similar utility in, inter alia, fracture repair. 

[0089] The foregoing descriptions detail presently preferred embodiments of the present invention. Numerous mod- 
ifications and variations in practice thereof are expected to occur to those skilled in the art upon consideration of these 
descriptions. Those modifications and variations are believed to be encompassed within the claims appended hereto. 

Claims 

1. A gene encoding human BMP-1 comprising the following DNA sequence: 
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15 
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10 20 30 (1) 50 

crftGAGGcas crroocroGC asocGcooas ocagc atc ax ggc gtg gcc cgc cig oas 

MSI Pro Gly Val Ala. -tog leu Pro 

65 80 95 no 

CTC COE CTC GGG GG CTO CDS CTC CCS CSS CCC GGC GGG OQrc CIG G&C TIG GCC 

leu leu leu Gly Leu Lsu lsu leu Pro Arg Pro Gly Arg Pro Lai Asp Leu Ala 

125 KO 155 

GAC T*C ACC TAT ©C CTC GOG <3>£ ®*G GAC GAC TCG GAG CCC CIC AAC T AC AAA 

Asp lyr Bjr Tyr Asp leu Ala Glu Glu Asp Asp Ser Glu pro leu Asn Tyr LyB 

170 185 200 215 

GAC OCC TCC AAG GCS GCT GCC TIT CTT GGG GAC AIT GGC CTC GAC GAA <3\G GAC 

Asp Pro cys lys Ala Ala Ala Ifee leu Gly Asp lie Ala Lsu Asp Glu Glu Asp 

230 245 260 275 

CTC AGG GCC TIC CA G GJA CAG C AG GCT GTG GAT CTC AGA OGG CSC ACA GCT COT 

Zeu Arg Ala Hie Gin Val Gin Gin Ala Val Asp Leu Arg Arg His Our Ala Arg 

. 290 305 320 

" AAG TCT TOC -ASC -ASA GOT CCA GIT OCA "GtSV AAC ACT TCT ACC CCC 'JSC TCC GSG" 

Lys Sear Ser He lys Ala Ala Val Pro Gly Asn *Dir Ser Thr Pro Ser Cys Gin 

335 350 •• ‘ ■"*' 365 380 ’ " 

AGCACCAACGCSGCAGOCTCAGAGGGGAGOCTCr GGG AGA TCG AGA GCT AflA TCC 
Ser Hir Asn Gly Gin Pro Gin Arg Gly Ala Cys Gly Arg Trp Arg Gly Arg Ser 



395 410 425 

OGT AGC CGG OGG GCG GOG ACG TCC OG& CCA GAG C3T GTG TCG OCC GAT GGG GTC 

Arg Ser Arg Arg Ala Ala ttir Ser Arg Ko Glu Arg Val ®ep Pro Asp Gly Vol 

440 455 470 485 

AIC OCC TIT GTC AST GGG GGA AAC TIC ACT GGT AGC CAS AGG GCA GTC TIC 

lie pro PSie Val He Gly Gly Asn Hie Bir Gly Ser Gin Arg Ala Val She 



500 

CAG G0C ATC 
Gin Ala HET 



515 530 

AGG CA C TCG GAG AAC CAC ACC TCI CTC ACC TIC CTC GAG OGC 
Arg His Tro Glu IysHisTCrCysV&lTCrRieLeu Glu Arg 



560 573 590 

GAC GAG GAC AGC TAT ATT GTG TTC ACC TAT OGA OCT TCC GGG TCC TCC TOC 

^SSSS^SvalB»ttiiyr«9iwcy 5 =lyCimcv.s«: 

620 535 fSQ 

^v-^^rnr.aaiGGGGGCOCCCAGGCCATCTOCAICGGCAAGAACTCrGAC 

SSSSSSSKoGlnAlalleSerne cly Lys Asn Cys Asp 



50 



55 



30 



XX. XBX 18 
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€65 680 695 

5 AflG TTC GGC AIT GTO GIC GAC OC CR> GGC CAC OK QIC GGC TTC TOG CAC GAA 

Iys She Gly Ha Val Val His dn Lea Gly His V&l v&l Gly Eia Tip His du 

710 725 740 755 

CAC ACT CEG CCA OC CGG GAC CSC CSC GTT TOC ATC GTT CCT’d&’iAAC ASC CAS 

w His Thr Arg Pro Asp Arg Asp Arg Els Vtol Ser Ha val Jag Glu Asn He dn 

770 785 800 815 

CCA 033 (30 SAG TAT AAC TTC CDS ASO JOG GAS OCT CAS GAG GIG GAG TOC CIO 

Pro Gly dn Glu Tyr Asn Ifca Leu Iys met du Pro da Glu Val Glu ser Zsu 

15 830 345 860 

GGG GAG AOC ®T GAC TIC ®C AGC ASS ASO <3t T T&C GCT OGS AAC ACA TTC TOC 

Gly dn Thr Tyr A^) Rsa Asp Set He MSP His Tyr Ala Arg Asn Bar She Ser 



875 890 905 920 

20 AGO GGC AOC TTC C3G GOT ACC JOT GSC GDC AAS TAT CAS GTS AAC GOO GTS AAA 

Arg Gly Ha Rte leu Asp Thr Ha v&l Pro iys Tyr Gn V&l Asn Gly val Iys 



935 950 965 

OCT COC ATT OSC CAA ASS ACA CSC CSC AGC MB GOG GAC AIT GOC CAA GOC OGC 
ft® XLa Gly Gla Arg Tbr Arg lea Ser Iys Gly Asp He Ala da AlaAig 

980 995 1010 1025 





AAC 


err 


me 


MG 


TOC 


OCA 


GOC. 


TOT 


GC a 


GAG 


ACC CTO 


CAA GAC AGC 


ACA GSC AAC 




Iys 


Lsu 


Tyr 


Iys 


eye 


ft® 


Ala 


cys 


Gly 


du 


Thr Leu 


can A^s-Ssr Thr Gly Asn. 


30 




1040 








1055 






1070 


1085 




TTC 


TOC 


TOC 


OCT 


GAA 


me 


ccc 


AAT 


GGC 


me 


TOT GCT 


CAC ATO CAC 


TOC CTO TBS 




Rue 


Sar 


Ser 


ft® 


du 


Tyr 


Fro 


Asn 


dy 


Tyr 


Ser Ala 


His HEP His 


Cys Val mp 










1100 








1115 




1130 


TOC CTO GAC 


35 


OGC 


A3C 


TOT 


GIC 


ACA 


ccc 


GGG 


GAG 


AAG 


use 


AIC CTO 


AAC TTC ACS 




Arg 


He 


Ser 


vai 


2hr. 


pro 


dy 


GlU 


Iys 


He 


lie Lsu 


Asn She Thr 


Ser leu Aep 




1145 








1160 








1175 




1190 




CIS 


TAC 


OSC 


AGC 


GGC 


CTO 


TOC 


TOG 


me 


GAC 


TAT CTO 


GAG GIC CSA 


GAT GGC TTC 


40 


Leu 


iyr 


Arg 


Ser 


Arg 


I ax 


<ys 


Trp 


Tyr 


Asp 


Tyr val 


Glu val Arg 


Asp Gly She 








12 OS 








1220 






1235 


OCT GAG OCT 




.TOO 


AGS 


J&G 


GOS 


ooc 


CSC 


OGA 


GSC 


OGC 


TTC 


TOC GGG 


TOC AAA CXC 




Trp 


Arg 


lya 


Ala 


Pro 


Lou 


Arg 


dy 


Arg 


Hia 


cys Gly 


Ser iys Leu 


Pro du Rco 


45 


1250 

rrrr 


TOC 


ACT 


GAC 


1265 

AGC 


OOC 


CSC 


TOG 


GTT 


1280 
GAA TTC 


OGC AGC AGC 


1295 

AGC AAT TOT 




lie 


Vial 


Ser 


Thr 


Asp 


Ser 


Arg 


Leu 


Tip 


Val 


Glu His 


Arg Ser Ser 


Ser Asn Trp 



50 



1310 

GTT SGA AAG GGC TTC 
val Gly Iys Gly Hie 



1325 1340 
ITT GCA GIC ZAC GAA GOC ADC TGC 
a ja Ala Val Tyr Glu Ala He Cys 



1355 

GGG GOT GAT GIG AAA 
Gly Gly Asp val iys 



55 



31 
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15 



20 
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45 
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55 



1370 1385 1400 

AAG GAC TAT GGC CAC ATT CAA TOG COG AAC TAC CCA GAC GAIT TAC CBS COC ACC 
lys Asp Tyr Gly Hie Us Gin Car Pso Asn Tyr Pro Asp Asp Tyr Arg Pro Ser 

1415 1430 1445 - 4 '* 1460 

AAA CSC TOC ATC TOG CGG ATC OG GTO TOT GAG GGC TTC CSC GZG GGC CTO AC& 
lys Val Cys He lip Arg He Gin V&L Ser Glu Gly She His Val Gly Leu dr 

1475 1490 1505 

TTC CM2 TOC TIT GAG ATT GAG GGC CAC GAC AGO TGT GGC T AC G3C TAT CPS GAS 
Hu Gin Sear Rib Glu He Gin Arg His Asp Ser Cys Ala Tyr Asp lyr Lai Glu 

1520 1535 1550 1565 

GTO OSC GAC GGG CAC ACT <3G AGC AGC ADC CTO ATC GGS CGC TAC TOT GGC TAT 

val Arg Asp Gly His Ser Gin Ser Ser dr Lsn He Gly Arg Tyr Cys Gly Tyr 

1580 1593 1610 1625 

GAG AAG OCT GAT GAC.ATO AAG AGC PCS TOC AGC CSC CTO TOG CTO AAG TTC GTO 
can lys Pro Asp Asp He lye Sar dr Ser Ser Arg leu Tsp leu lys F&a Val 

1640 1655 1670 

TOT GAC GGG TOC ATT AAC AAA GOG GGC ITT GCC OTC AAC TTT TTC AAA GAG GIG 
Ser Asp Gly Ser He Asn ly^ Ala Gly Bus Ala Val ten Rib Ffee Lys Glu .Val 

1685 1700 1715 1730 

GAC GAG .TOC TOT CGS OOC AAC CSC. G93 G3C TJsrjfflG. CAS COG TOC CXC AAC ACC 

Asp Glu Cy s Ser" Arg Pro to aigdyfly Cys Glu Gin Arg - Cys Lai Asn 'dr 

1745 1760 1775 

CIS QBC AGC TAC AAG TOC AGC TOT GAC OOC GGS TAC GAG CTO GCC CCA GAC AAG 
Lsu Gly Ser Tyr lys Cys Ser cys Asp Pro Gly Tyr Glu leu Ala Pro Asp lys 

1790 1805 1820 1835 

CG C CGC TOT <afl GCT GCX TOT GGC GSV TTC CTO ACC AAG CXC AAC GGC TOC ATC 
Arg Arg Cys Glu Ala Ala cys Gly Gly Fbe leu dr lys leu Asn dy Ser He 



1850 1865 I860 1895 

ACC AGC OOG GGC TOG OOC AAG GAG TAC OCC OCC AAC AAG AAC TOC AGC TGG CAG 

Tbr Ser Fro Gly Tip Pto lys Glu Tyr B® B® Asn lye Asn Cys He dp Gd 



1910 1925 1940 

CTO GIG GOC OOC ACC CAG TAC OGC ATC TOC CTO CM3 TOT GAC TOC TOT GAG ACA 
Lai val Ala Pro dr Gin Tyr Arg He ser leu Gin R» Psp R» Rua Glu dr 



. yvjn 1985 2000 

GAG GGC AKT GAT GIG TOC AAG TAC ®C TIC GIG GAG GIG OGC AGT ^ 

SS S Sn Asp vS cys lys Tyr Asp da val Glu val Arg ser dy urn dr 



2015 

GOT GAC TCC AAG 
Ala Asp ser lys 



2030 

CTO CAT GGC AAG TTC TOT GGT TOT 
leu aie Gly lys Cys Gly Ser 



2045 

CAG AAG C0C GAG GTC ATC 
Glu lys Pro Glu val lie 



32 
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5 2060 2075 2090 210S 

ACC TOC CAS TfcC AAC AAC MX? CSC GIO OS TIC MG TOC GAC MC ACC GIG TOC 
Uir Sar dn iyr Asn Aan MET Arg Val Glu ifce I#b Sar Asp Asn Thr V&l Ser 

2120 2125 2150 2165 

io AAA AAfi G3C TIC AAG GCC CAC TIC TOC TCA (5\A AAG AGS CCA GCT. CXJS CMS CO C 
lys lys dy Eta lys Ala His Eba Ete Ser Glu lys Azg Pro Ala Lea Gin Pro 

2180 2195 2210 

OCT ass CSS OSC COC CAC CM CEO AAA SIC OSA GIB CAS AAA AGA AAC 0SG ACC 
, 5 Pro Azg Gly Azg Pro His dn Leu lys jfca Arg Val Gin lys Azg Asn Azg lhr 
(730) 

2235 2235 2245 22SS 2265 2275 2285 

COC CAS TB ft QO OCT QC CaaSO CT C C C GSaC OOC T TO TSACTCAGSA ACCTCACCTT GSACSGAATO 
ao Gin 

20 2295 2305 2315 2325 2335 2345 2355 

GGATOGGGGC TTOOG B SCOC ACCAACOOOC CAOCICCSCT CTGOCASTOC GSOOCAOCTO OCTCT B GOCS 



2365 2375 2385 2395 2405 2415 2425 

25 StfaGAAOG GTXTC XCT r CTOGCTACIG TCCOCGTO0S CBGAO0SQ 3G A OO CITOOO C GIGC0C3ACC 



• . - - i ----- - -v • . " ' . . 

2435 2445 2455 2465 2475 24S5 2495 

CCCTOCCMT TOMBCTCT CW&EAaXT TCCKl TICT S CSSS^apCI G CCIOCTMCT . 






35 2. A gene encoding human BMP.-1 having the amino acid sequence given in claim 1 . 

3. A gene encoding a protein exhibiting properties of BMP-1 and comprising DNA sequences: 

(a) which differ from a DNA sequence of claim 1 in codon sequence due to the degeneracy of the genetic code; 
4 o (b) which hybridise with a DNA sequence of claim 1 or section (a), above; or 

(c) represent fragments, allelic or other variations of a DNA sequence of claim 1 , whether said variations result 
in changes in the peptide sequence or not. 

4. The DNA sequence of claim 3, which is a genomic DNA sequence. 

45 

5. The DNA sequence of claim 3, which is a cDNA sequence. 

6. A gene encoding bovine BMP-1 comprising the following DNA sequence: 

50 



55 
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280 290 (1) 308 323 

ocittocxiict Tcrcierccfc gct gcc rac cut gqg <ac axc ooc cio ®c cm gag 

_Xla Ala I/ai Act GliiGlu 

338 353 ** 368 

<aC 1TO AGG <3QC TIC CAA CTO CMS CAG CCS GOG G££ CIC AGA CAG OGT GCA ACC 

Arq Ala. Phfi GXn Val CSln dn Ala Ala Aap-laa Ara Gin Aq JOa 

383 398 (37) 414 424 

OGC AOTTCT TOC JOC AAA GCT GCA GCTACACXQ3 GEACAGGCC& 

Arg Arg Ser S&r Ha Iys Ala Ala 

7. A gene encoding bovine BMP-1 containing the amino acid sequence of claim 6. 

8. A gene encoding a protein exhibiting properties of bovine BMP-1 and comprising DNA sequences: 

(a) which differ from a DNA sequence of claim 6 in codon sequence due to the degeneracy of the genetic code; 

(b) which hybridise with a DNA sequence of claim 6 or section (a), above; or 

(c) represent fragments, allelic or other variations of a DNA sequence of claim 6, whether said variations result 
in changes in the peptide sequence or not. 

9. The DNA sequence of claim 8, Which is a genomic DNA sequence. 



10. The DNA sequence of claim 8, which is a cDNA sequence. 



11. A vector containing the gene or DNA sequence of any one of claims 1 to 10 in operative association with an 
expression control sequence. 

12. A cell transformed with a vector of claim 11 . 

13. The cell of claim 12 which is a mammalian cell, a bacterial cell, an insect cell or a yeast cell. 

14. A protein exhibiting properties of BMP-1 which is encoded by the gene or DNA sequence of any one of claims 1 
to 10. 



15. A process for producing the protein of claim 14, comprising the steps of culturing in a suitable culture medium the 
cell of claim 13 and isolating said protein from said culture medium. 

16. A pharmaceutical composition comprising the protein of claim 14 and a pharmaceutically acceptable vehicle. 

17. The pharmaceutical composition of Claim 16, further comprising a matrix capable of delivering the composition to 
the site of the bone or cartilage defect and providing a structure for inducing bone or cartilage formation. 

18. The pharmaceutical composition of claim 17, wherein said matrix comprises hydroxyapatite, collagen, polylactic 
acid or tricalcium phosphate. 

19. Use of a protein of claim 14 for the preparation of a pharmaceutical composition for inducing bone or cartilage 
formation. 
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